RT Journal A1 Benos, Panayiotis V. A1 Gatt, Melanie K. A1 Murphy, Lee A1 Harris, David A1 Barrell, Bart A1 Ferraz, Concepcion A1 Vidal, Sophie A1 Brun, Christine A1 Demaille, Jacques A1 Cadieu, Edouard A1 Dreano, Stephane A1 Gloux, Stéphanie A1 Lelaure, Valerie A1 Mottier, Stephanie A1 Galibert, Francis A1 Borkova, Dana A1 Miñana, Belen A1 Kafatos, Fotis C. A1 Bolshakov, Slava A1 Sidén-Kiamos, Inga A1 Papagiannakis, George A1 Spanos, Lefteris A1 Louis, Christos A1 Madueño, Encarnación A1 de Pablos, Beatriz A1 Modolell, Juan A1 Peter, Annette A1 Schöttler, Petra A1 Werner, Meike A1 Mourkioti, Fotini A1 Beinert, Nicole A1 Dowe, Gordon A1 Schäfer, Ulrich A1 Jäckle, Herbert A1 Bucheton, Alain A1 Callister, Debbie A1 Campbell, Lorna A1 Henderson, Nadine S. A1 McMillan, Paul J. A1 Salles, Cathy A1 Tait, Evelyn A1 Valenti, Phillipe A1 Saunders, Robert D.C. A1 Billaud, Alain A1 Pachter, Lior A1 Glover, David M. A1 Ashburner, Michael T1 From First Base: The Sequence of the Tip of the X Chromosome of Drosophila melanogaster, a Comparison of Two Sequencing Strategies JF Genome Research JO Genome Research YR 2001 FD May 01 VO 11 IS 5 SP 710 OP 730 DO 10.1101/gr.173801 UL http://genome.cshlp.org/content/11/5/710.abstract AB We present the sequence of a contiguous 2.63 Mb of DNA extending from the tip of the X chromosome ofDrosophila melanogaster. Within this sequence, we predict 277 protein coding genes, of which 94 had been sequenced already in the course of studying the biology of their gene products, and examples of 12 different transposable elements. We show that an interval between bands 3A2 and 3C2, believed in the 1970s to show a correlation between the number of bands on the polytene chromosomes and the 20 genes identified by conventional genetics, is predicted to contain 45 genes from its DNA sequence. We have determined the insertion sites ofP-elements from 111 mutant lines, about half of which are in a position likely to affect the expression of novel predicted genes, thus representing a resource for subsequent functional genomic analysis. We compare the European Drosophila Genome Project sequence with the corresponding part of the independently assembled and annotated Joint Sequence determined through “shotgun” sequencing. Discounting differences in the distribution of known transposable elements between the strains sequenced in the two projects, we detected three major sequence differences, two of which are probably explained by errors in assembly; the origin of the third major difference is unclear. In addition there are eight sequence gaps within the Joint Sequence. At least six of these eight gaps are likely to be sites of transposable elements; the other two are complex. Of the 275 genes in common to both projects, 60% are identical within 1% of their predicted amino-acid sequence and 31% show minor differences such as in choice of translation initiation or termination codons; the remaining 9% show major differences in interpretation.[All of the sequences analyzed in this paper have been deposited in the EMBL-Bank database under the following accession nos.: AL009146,AL009147, AL009171, AL009188–AL009196, AL021067, AL021086,AL021106–AL021108, AL021726, AL021728, AL022017, AL022018, AL022139,AL023873, AL023874, AL023893, AL024453, AL024455–AL024457, AL024485,AL030993, AL030994, AL031024–AL031028, AL031128, AL031173, AL031366,AL031367, AL031581–AL031583, AL031640, AL031765, AL031883, AL031884,AL034388, AL034544, AL035104, AL035105, AL035207, AL035245, AL035331,AL035632, AL049535, AL050231, AL050232, AL109630, AL121804, AL121806,AL132651, AL132792, AL132797, AL133503–AL133506, AL138678, AL138971,AL138972, and Z98269. A single file (FASTA format) of the 2.6-Mb contig is available fromftp://ftp.ebi.ac.uk/pub/databases/edgp/contigs/contig_1.fa.]