Database Divisions and Homology Search Files: A Guide for the Perplexed

Table 2.

Relationships Between Divisions and Homology Search Files

Database division BLAST databases at NCBI FASTA databases at EBI Location of “finished” HTG records
BCT nr, month emall, emnew, ebact
PRO emall, emnew, epro
FUN emall, emnew, efun
HUM emall, emnew, ehum H. sapiens(EMBL)
PRI nr, month H. sapiens(GenBank)
ROD nr, month emall, emnew, erod
MAM nr, month emall, emnew, emam
VRT nr, month emall, emnew, evrt
INV nr, month emall, emnew, einv C. elegans and  D. melanogaster
PLN nr, month emall, emnew, epln A. thaliana
ORG emall, emnew, eorg
VRL nr, month emall, emnew, evrl
PHG nr, month emall, emnew, ephg
RNA nr, month emall, emnew, erna
SYN nr, month emall, emnew, esyn
UNA nr, month emall, emnew, euna
EST dbest,month eest
STS dbsts, month ests
GSS dbgss, month emall, emnew
HTG htgs, month emall, emnew Includes all “unfinished”  HTG
PAT nr, month emall, emnew, epat
  • (month) A rolling month database consisting of nucleotide or protein sequences added to nr in the last 28 days; (nr) a nonredundant nucleotide (or protein) database of all sequences, excluding ESTs, STSs, GSSs, and HTGs; (emnew) new EMBL entries since latest release; (emall) all EMBL entries, latest release + new (other FASTA database acronyms are derived from the EMBL division to which they correspond).

  • NCBI offers ecoli as a separate BLAST database for queries against Escherichia coli genome and protein sequences.

  • NCBI offers yeast as a separate BLAST database for queries against the Saccharomyces cerevisiae genome and protein sequences.

  • NCBI plans to split dbest into three files of human only, mouse only, and all nonhuman, nonmouse ESTs.

This Article

  1. Genome Res. 7: 952-955

Preprint Server