Natural variation in genome architecture among 205 Drosophila melanogaster Genetic Reference Panel lines
- Wen Huang1,
- Andreas Massouras2,
- Yutaka Inoue3,
- Jason Peiffer1,
- Miquel Rámia4,
- Aaron Tarone5,
- Lavanya Turlapati1,
- Thomas Zichner6,
- Dianhui Zhu7,
- Richard Lyman1,
- Michael Magwire1,
- Kerstin Blankenburg7,
- Mary Anna Carbone8,
- Kyle Chang7,
- Lisa Ellis5,
- Sonia Fernandez7,
- Yi Han7,
- Gareth Highnam9,
- Carl Hjelmen5,
- John Jack1,
- Mewish Javaid7,
- Joy Jayaseelan7,
- Divya Kalra7,
- Sandy Lee7,
- Lora Lewis7,
- Mala Munidasa7,
- Fiona Ongeri7,
- Shohba Patel7,
- Lora Perales7,
- Agapito Perez7,
- LingLing Pu7,
- Stephanie Rollmann1,
- Robert Ruth7,
- Nehad Saada7,
- Crystal Warner10,
- Aneisa Williams7,
- Yuan-Qing Wu7,
- Akihiko Yamamoto1,
- Yiqing Zhang7,
- Yiming Zhu7,
- Robert Anholt1,
- Jan Korbel6,
- David Mittelman9,
- Donna Muzny7,
- Richard Gibbs7,
- Antonio Barbadilla4,
- Spencer Johnston5,
- Eric Stone1,
- Stephen Richards7,
- Bart Deplancke2 and
- Trudy Mackay1,11
- 1 North Carolina State University;
- 2 Ecole Polytechnique Fédérale de Lausanne;
- 3 Osaka University;
- 4 Universitat Autónoma de Barcelona;
- 5 Texas A&M University;
- 6 European Molecular Biology Laboratory;
- 7 Baylor College of Medicine;
- 8 North Carolina State Univerity;
- 9 Virginia Tech;
- 10 Baylor Colege of Medicine
- ↵* Corresponding author; email: trudy_mackay{at}ncsu.edu
Abstract
The Drosophila melanogaster Genetic Reference Panel (DGRP) is a community resource of 205 sequenced inbred lines, derived to improve our understanding of the effects of naturally occurring genetic variation on molecular and organismal phenotypes. We used an integrated genotyping strategy to identify 4,853,802 single nucleotide polymorphisms (SNPs) and 1,296,080 non-SNP variants. Our molecular population genomic analyses show higher deletion than insertion mutation rates, and stronger purifying selection on deletions. Weaker selection on insertions than deletions is consistent with our observed distribution of genome size determined by flow cytometry, which is skewed towards larger genomes. Insertion/deletion and single nucleotide polymorphisms are positively correlated with each other and with local recombination, suggesting their non-random distributions are due to hitchhiking and background selection. Our cytogenetic analysis identified 16 polymorphic inversions in the DGRP. Common inverted and standard karyotypes are genetically divergent and account for most of the variation in relatedness among the DGRP lines. Intriguingly, variation in genome size and many quantitative traits are significantly associated with inversions. Approximately 50% of the DGRP lines are infected with Wolbachia, and four lines have germline insertions of Wolbachia sequences, but effects of Wolbachia infection on quantitative traits are rarely significant. The DGRP complements ongoing efforts to functionally annotate the Drosophila genome. Indeed, 15% of all D. melanogaster genes segregate for potentially damaged proteins in the DGRP, and genome wide analyses of quantitative traits identify novel candidate genes. The DGRP lines, sequence data, genotypes, quality scores, phenotypes and analysis and visualization tools are publicly available.
- Received December 20, 2013.
- Accepted April 1, 2014.
- Published by Cold Spring Harbor Laboratory Press
This manuscript is Open Access.
This article, published in Genome Research, is available under a Creative Commons License (Attribution 3.0 Unported), as described at http://creativecommons.org/licenses/by/3.0/.











