TY - JOUR A1 - Truong, Duy Tin A1 - Tett, Adrian A1 - Pasolli, Edoardo A1 - Huttenhower, Curtis A1 - Segata, Nicola T1 - Microbial strain-level population structure and genetic diversity from metagenomes Y1 - 2017/02/06 JF - Genome Research JO - Genome Research DO - 10.1101/gr.216242.116 SP - gr.216242.116 UR - http://genome.cshlp.org/content/early/2017/02/06/gr.216242.116.abstract N2 - Among the human health conditions linked to microbial communities, phenotypes are often associated with only a subset of strains within causal microbial groups. While it has been critical for decades in microbial physiology to characterize individual strains, this has been challenging when using culture-independent high-throughput metagenomics. We introduce StrainPhlAn, a novel metagenomic strain identification approach, and apply it to characterize the genetic structure of thousands of strains from >125 species in >1,500 gut metagenomes drawn from populations spanning North/South American, European, Asian, and African countries. The method relies on per-sample dominant sequence variant reconstruction within species-specific marker genes. It identified primarily subject-specific strain variants (<5% inter-subject strain sharing), and we determined that a single strain typically dominated each species and was retained over time (for >70% of species). Microbial population structure was correlated in several distinct ways with the geographic structure of the host population. In some cases discrete subspecies (e.g. for Eubacterium rectale and Prevotella copri) or continuous microbial genetic variations (e.g. for Faecalibacterium prausnitzii) were associated with geographically distinct human populations, whereas few strains occurred in multiple unrelated cohorts. We further estimated the genetic variability of gut microbes, with Bacteroides species appearing remarkably consistent (0.45% median number of nucleotide variants between strains) whereas P. copri was among the most plastic gut colonizers. We thus characterize here the population genetics of previously inaccessible intestinal microbes, providing a comprehensive strain-level genetic overview of the gut microbial diversity. ER -