TY - JOUR A1 - Henikoff, Jorja G. A1 - Henikoff, Steven T1 - Drosophila Genomic Sequence Annotation Using the BLOCKS+ Database Y1 - 2000/04/01 JF - Genome Research JO - Genome Research SP - 543 EP - 546 DO - 10.1101/gr.10.4.543 VL - 10 IS - 4 UR - http://genome.cshlp.org/content/10/4/543.abstract N2 - A simple and general homology-based method for gene finding was applied to the 2.9-Mb Drosophila melanogaster Adh region, the target sequence of the Genome Annotation Assessment Project (GASP). Each strand of the entire sequence was used as query of theBLOCKS+ database of conserved regions of proteins. This led to functional assignments for more than one-third of the genes and two-thirds of the transposons. Considering the enormous size of the query, the fact that only two false-positive matches were reported emphasizes the high selectivity of protein family-based methods for gene finding. We used the search results to improveBLOCKS+ by identifying compositionally biased blocks. Our results confirm that protein family databases can be used effectively in automated sequence annotation efforts. ER -