RCR Databases

The RCR maintains local copies of databases of DNA and protein sequences. These databases are updated regularly (daily or weekly) and integrated with our GCG sequence analysis system. Use the following tables as a guide to choose the appropriate database (or database sub-set) to make your sequence searching most efficient. In order to search a database with GCG's FASTA program, type the short name of that database followed by a colon and an asterisk, i.e. pr:*

DNA databases (GenBank Subdivisions)

Genbank nameShort nameDescription
gbgbAll GenBanks sections except est, sts, gss, htg
gbpgbp, gbplusAll GenBank sections including est, gss, htg, sts
gb_baba, bct, bactBacterial
gb_inin, invInvertebrate
gb_omom, mamOther Mammalian (non-rodent, non-primate)
gb_ovov, vertOther Vertebrate (non-mammalian vertebrates)
gb_oror, orgOrganelle
gb_patpatPatents
gb_phph, phg, phagePhage
gb_plpl, pln, plantPlant
gb_prpr, pri, primPrimate
gb_roro, rodRodent
gb_stst, strStructural RNA
gb_sysy, syn, synthSynthetic sequences (recombinant constructs, etc.)
gb_unun, unanUnannotated
gb_vivi, virViral
gb_est#estExpressed Sequence Tags (short cDNAs)
gb_stsstsSequence Tagged Sites
gb_gssgssGenomic Survey Sequences (large genomic contigs)
gb_htghtgHigh Throughput Genomic sequences (unannotated single pass sequences produced by the genome projects)
gb_tagstagsESTs, STS, gss, and htg


Protein Databases

Database NameShort nameDescription
GenPeptgptranslations of all GenBank sequences
Human ProteinshspRefSeq human proteins
SwissProtsp, swissall annotated protein sequences
TREMBLtrembltranslations of all EMBL sequences
Swissprot+TREMBLsptrembl, tptranslated EMBL + SwissProt
PIRpirComplete NBRF protein database
pir1pir1annotated NBRF sequences
pir2pir2new NBRF sequences
pir3pir3unannotated NBRF sequences
pir4pir4unencoded NBRF sequences
PDBpdb, nrl_3dproteins with known 3-D structures