TY - JOUR A1 - Benos, Panayiotis V. A1 - Gatt, Melanie K. A1 - Murphy, Lee A1 - Harris, David A1 - Barrell, Bart A1 - Ferraz, Concepcion A1 - Vidal, Sophie A1 - Brun, Christine A1 - Demaille, Jacques A1 - Cadieu, Edouard A1 - Dreano, Stephane A1 - Gloux, Stéphanie A1 - Lelaure, Valerie A1 - Mottier, Stephanie A1 - Galibert, Francis A1 - Borkova, Dana A1 - Miñana, Belen A1 - Kafatos, Fotis C. A1 - Bolshakov, Slava A1 - Sidén-Kiamos, Inga A1 - Papagiannakis, George A1 - Spanos, Lefteris A1 - Louis, Christos A1 - Madueño, Encarnación A1 - de Pablos, Beatriz A1 - Modolell, Juan A1 - Peter, Annette A1 - Schöttler, Petra A1 - Werner, Meike A1 - Mourkioti, Fotini A1 - Beinert, Nicole A1 - Dowe, Gordon A1 - Schäfer, Ulrich A1 - Jäckle, Herbert A1 - Bucheton, Alain A1 - Callister, Debbie A1 - Campbell, Lorna A1 - Henderson, Nadine S. A1 - McMillan, Paul J. A1 - Salles, Cathy A1 - Tait, Evelyn A1 - Valenti, Phillipe A1 - Saunders, Robert D.C. A1 - Billaud, Alain A1 - Pachter, Lior A1 - Glover, David M. A1 - Ashburner, Michael T1 - From First Base: The Sequence of the Tip of the X Chromosome of Drosophila melanogaster, a Comparison of Two Sequencing Strategies Y1 - 2001/05/01 JF - Genome Research JO - Genome Research SP - 710 EP - 730 DO - 10.1101/gr.173801 VL - 11 IS - 5 UR - http://genome.cshlp.org/content/11/5/710.abstract N2 - We present the sequence of a contiguous 2.63 Mb of DNA extending from the tip of the X chromosome ofDrosophila melanogaster. Within this sequence, we predict 277 protein coding genes, of which 94 had been sequenced already in the course of studying the biology of their gene products, and examples of 12 different transposable elements. We show that an interval between bands 3A2 and 3C2, believed in the 1970s to show a correlation between the number of bands on the polytene chromosomes and the 20 genes identified by conventional genetics, is predicted to contain 45 genes from its DNA sequence. We have determined the insertion sites ofP-elements from 111 mutant lines, about half of which are in a position likely to affect the expression of novel predicted genes, thus representing a resource for subsequent functional genomic analysis. We compare the European Drosophila Genome Project sequence with the corresponding part of the independently assembled and annotated Joint Sequence determined through “shotgun” sequencing. Discounting differences in the distribution of known transposable elements between the strains sequenced in the two projects, we detected three major sequence differences, two of which are probably explained by errors in assembly; the origin of the third major difference is unclear. In addition there are eight sequence gaps within the Joint Sequence. At least six of these eight gaps are likely to be sites of transposable elements; the other two are complex. Of the 275 genes in common to both projects, 60% are identical within 1% of their predicted amino-acid sequence and 31% show minor differences such as in choice of translation initiation or termination codons; the remaining 9% show major differences in interpretation.[All of the sequences analyzed in this paper have been deposited in the EMBL-Bank database under the following accession nos.: AL009146,AL009147, AL009171, AL009188–AL009196, AL021067, AL021086,AL021106–AL021108, AL021726, AL021728, AL022017, AL022018, AL022139,AL023873, AL023874, AL023893, AL024453, AL024455–AL024457, AL024485,AL030993, AL030994, AL031024–AL031028, AL031128, AL031173, AL031366,AL031367, AL031581–AL031583, AL031640, AL031765, AL031883, AL031884,AL034388, AL034544, AL035104, AL035105, AL035207, AL035245, AL035331,AL035632, AL049535, AL050231, AL050232, AL109630, AL121804, AL121806,AL132651, AL132792, AL132797, AL133503–AL133506, AL138678, AL138971,AL138972, and Z98269. A single file (FASTA format) of the 2.6-Mb contig is available fromftp://ftp.ebi.ac.uk/pub/databases/edgp/contigs/contig_1.fa.] ER -