AnoEST: Toward A. gambiae functional genomics
Abstract
Here, we present an analysis of 215,634 EST and cDNA sequences of a major vector of human malaria Anopheles gambiae structured into the AnoEST database. The expressed sequences are grouped into clusters using genomic sequence as template and associated with inferred functional annotation, including the following: corresponding Ensembl gene prediction, putative orthologous genes in other species, homology to known proteins, protein domains, associated Gene Ontology terms, and corresponding classification into broad GO-slim functional groups. AnoEST is a vital resource for interpretation of expression profiles derived using recently developed A. gambiae cDNA microarrays. Using these cDNA microarrays, we have experimentally confirmed the expression of 7961 clusters during mosquito development. Of these, 3100 are not associated with currently predicted genes. Moreover, we found that clusters with confirmed expression are nonbiased with respect to the current gene annotation or homology to known proteins. Consequently, we expect that many as yet unconfirmed clusters are likely to be actual A. gambiae genes. [AnoEST is publicly available at http://komar.embl.de, and is also accessible as a Distributed Annotation Service (DAS).]
Footnotes
-
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.3756405. Article published online ahead of print in May 2005. Freely available online through the Genome Research Immediate Open Access option.
-
↵1 These authors contributed equally to this work.
-
↵2 Corresponding author. E-mail zdobnov{at}embl.de; fax 49-6221-387-517.
-
- Accepted April 13, 2005.
- Received January 26, 2005.
- Cold Spring Harbor Laboratory Press











