Comparative RNA sequencing reveals substantial genetic variation in endangered primates

George H. Perry; Páll Melsted; John C. Marioni; Ying Wang; Russell Bainer; Joseph K. Pickrell; Katelyn Michelini; Sarah Zehr; Anne D. Yoder; Matthew Stephens; Jonathan K. Pritchard; Yoav Gilad

doi:10.1101/gr.130468.111

Comparative RNA sequencing reveals substantial genetic variation in endangered primates

¹Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA;
²Howard Hughes Medical Institute, University of Chicago, Chicago, Illinois 60637, USA;
³Duke Lemur Center, Duke University, Durham, North Carolina 27705, USA;
⁴Department of Biology, Duke University, Durham, North Carolina 27708, USA;
⁵Department of Evolutionary Anthropology, Duke University, Durham, North Carolina 27708, USA;
⁶Department of Statistics, University of Chicago, Chicago, Illinois 60637, USA

↵7 These authors contributed equally to this work.
↵8 Present address: Faculty of Industrial Engineering, Mechanical Engineering, and Computer Science, University of Iceland, 107 Reykjavik, Iceland.

Next Section

Abstract

Comparative genomic studies in primates have yielded important insights into the evolutionary forces that shape genetic diversity and revealed the likely genetic basis for certain species-specific adaptations. To date, however, these studies have focused on only a small number of species. For the majority of nonhuman primates, including some of the most critically endangered, genome-level data are not yet available. In this study, we have taken the first steps toward addressing this gap by sequencing RNA from the livers of multiple individuals from each of 16 mammalian species, including humans and 11 nonhuman primates. Of the nonhuman primate species, five are lemurs and two are lorisoids, for which little or no genomic data were previously available. To analyze these data, we developed a method for de novo assembly and alignment of orthologous gene sequences across species. We assembled an average of 5721 gene sequences per species and characterized diversity and divergence of both gene sequences and gene expression levels. We identified patterns of variation that are consistent with the action of positive or directional selection, including an 18-fold enrichment of peroxisomal genes among genes whose regulation likely evolved under directional selection in the ancestral primate lineage. Importantly, we found no relationship between genetic diversity and endangered status, with the two most endangered species in our study, the black and white ruffed lemur and the Coquerel's sifaka, having the highest genetic diversity among all primates. Our observations imply that many endangered lemur populations still harbor considerable genetic variation. Timely efforts to conserve these species alongside their habitats have, therefore, strong potential to achieve long-term success.

Comparative genomics is a powerful approach to study evolutionary processes, often used to identify functionally constrained genomic regions (Bejerano et al. 2004; Alexander et al. 2010) or to infer species-specific adaptations and the associated biological mechanisms (Oleksiak et al. 2002; Abzhanov et al. 2006; Gilad et al. 2006; Blekhman et al. 2008). The power of the comparative genomic approach increases with the number of species studied (Drosophila 12 Genomes Consortium 2007). Comparative genomic studies of primates, however, have so far focused mostly on the few species for which complete reference genome sequences are available, namely humans, chimpanzees, orangutans, and rhesus macaques (e.g., Caceres et al. 2003; Khaitovich et al. 2005; The Chimpanzee Sequencing and Analysis Consortium 2005; Gilad et al. 2006; Jiang et al. 2007; Rhesus Macaque Genome Sequencing and Analysis Consortium 2007; Locke et al. 2011).

Genomic data are particularly limited for lemurs (Horvath and Willard 2007), which represent a major primate radiation exclusive to the biodiversity and conservation hotspot of Madagascar (Brooks et al. 2002) and whose habitats have been shrinking rapidly over the past century due to deforestation (Green and Sussman 1990; Harper et al. 2007). Many of the 97 currently recognized lemur species are considered endangered or critically endangered (Mittermeier et al. 2008; International Union for Conservation of Nature 2010). We have very little knowledge of nuclear genetic diversity for any of these endangered species, yet such data are critical for planning conservation efforts because genetic diversity is associated with the risk of extinction (Frankham 2005; Palstra and Ruzzante 2008).

We sought to establish a more comprehensive primate comparative genomic database while simultaneously generating genetic diversity data that would benefit the conservation of endangered species. Since sequencing complete mammalian genomes from a large number of individuals remains prohibitively expensive and because effective DNA capture strategies (e.g., Gnirke et al. 2009)—especially for comparative genomic analysis—require a priori reference genome sequences, we chose an alternative approach for our study. Specifically, we used RNA-sequencing (RNA-seq) combined with a de novo gene assembly strategy to characterize liver transcriptomes from multiple individuals from each of 16 mammalian species, including 12 primates (Fig. 1A). The primates include five lemur species (aye-aye, Coquerel's sifaka, black and white ruffed lemur, crowned lemur, and mongoose lemur) and two other strepsirrhine primates (slow loris and Moholi bushbaby). Since little or no genomic information was previously available for most of the species in our study, we developed a new de novo assembly algorithm, which facilitated comparisons of gene sequences and expression levels within and between individuals and species.

View larger version:

Download as PowerPoint Slide

Figure 1.

Transcript assembly and phylogenetic reconstruction from RNA-seq data. (A) Typical example of an assembled gene, SNF8, with complete cross-species exon conservation. (Red bars) Identified homologies to the human SNF8 RefSeq coding sequence that were used to isolate the appropriate region of the de Bruijn graph during the assembly process. Divergence times are approximate and based on consensus estimates from previous studies. Photos of strepsirrhine primates were kindly provided by David Haring, Duke Lemur Center. (B) Neighbor-joining trees estimated from nucleotide sequence and gene expression data. Nucleotide sequence distance matrix was computed from concatenated multispecies alignments of coding sequences of 515 genes that were assembled for all 16 species. Gene expression pairwise correlation distance matrix was computed for species mean expression estimates using all genes assembled in at least six species (6494 genes). As expected, the known primate phylogeny was recapitulated perfectly from the nucleotide sequence data (see Supplemental Fig. S7 for the tree, also including bushbaby), with the only discrepancy among nonprimate mammals being the juxtaposition of the mouse and armadillo branches, likely explained by long branch attraction that is a common issue in phylogenetic analyses that include rodents (Cannarozzi et al. 2007). Variation in the expression data also follows a phylogenetic pattern but with slow loris erroneously placed outside all other primates and the misplacement of armadillo.

Our approach, therefore, allowed us to analyze nucleotide sequence, expression level, exon structure, and genetic diversity data from thousands of genes per species, using a cost-effective strategy. Using these data, we were able to identify signatures of positive and directional selection in extant and ancestral primate lineages and to examine the relationship between endangered status and genetic diversity across an extensive primate phylogeny.

Previous Section Next Section

Results

To collect comparative genomic diversity data on a large panel of species, we used RNA-seq combined with a de novo gene assembly strategy. We prepared RNA-seq libraries from liver samples from four unrelated individuals for each of 15 species and from two armadillos (Supplemental Methods). Each library was sequenced using one lane of the Illumina Genome Analyzer IIx with paired-end, 76-bp reads (2 × 76 bp). We obtained, on average, 16.4 million (±4.8M) 76-bp paired-end RNA sequencing reads per individual (2.5 Gb of nucleotide sequence per individual). For transcript assembly, we combined the sequence reads from all individuals for each species to generate consensus gene nucleotide sequences. For gene expression and genetic diversity analyses, we considered the sequence data from each individual separately.

De novo transcript assembly

Since sequenced genomes were not available for most of the species in our study, we developed a de Bruijn graph-based approach (Pevzner et al. 2001) for de novo assembly of the transcriptome of each species and simultaneous matching of gene orthologs. Our assembly process is described in detail in the Supplemental Methods (the transcript assembly code is available at http://pritch.bsd.uchicago.edu/software.html). Briefly, for each species we searched the de Bruijn graph for small-scale similarity (in 39-bp windows) to human RefSeq gene sequences (Fig. 1A). These homologous regions were used to set general expectations for transcript coverage levels and to isolate the portion of the graph likely to contain each gene sequence. While this step of our approach relies on the maintenance of sequence similarity between species, simulations demonstrate that our approach is robust to internal exon gains and losses in nonhuman species (Supplemental Methods). Exploring the subgraphs for each gene, we then filtered contigs with lower-than-expected coverage to remove intronic sequences and sequencing errors (Supplemental Fig. S1). We next aligned each remaining path through the graph to the corresponding human RefSeq gene and selected the sequence with the most aligned nucleotides, effectively removing erroneous paths through repetitive elements. We required the presence of at least 50% of the coding region (compared to the corresponding human RefSeq gene) to classify a gene as assembled and to include it in subsequent analyses. Finally, potentially paralogous gene sequences were identified and removed, and the expression levels of the remaining genes were estimated, independently for each sample, based on the number of sequence reads mapped to them. Using this approach, we assembled between 4789 and 5924 gene sequences for 15 species (but only 2680 genes from the bushbaby, probably due to RNA degradation in the bushbaby liver samples [Supplemental Fig. S2]; bushbaby samples were thus excluded from subsequent analyses of the gene expression data).

The availability of high-quality sequenced genomes for six of the species (human, chimpanzee, rhesus macaque, marmoset, mouse, and gray short-tailed opossum) allowed us to test the accuracy of our assembly approach. To do so, we compared gene sequences and estimates of gene expression levels based on the de novo assembly to estimates based on a more conventional genome alignment approach (Supplemental Methods). On average, 98.1% (±1.9%) of the corresponding pairs of de novo assembly and reference genome transcripts had identical or near-identical sequences (≥97%, allowing for polymorphisms) (Supplemental Fig. S3). Estimates of expression levels from the assembled genes and the genome alignment approach were also highly correlated (mean Spearman rank correlation coefficient r > 0.90 for all comparisons) (Supplemental Fig. S4). These observations indicate that the quality of the assembled data is high. This conclusion is further supported by our ability to nearly perfectly recapitulate the known primate phylogeny (Perelman et al. 2011) based on either the sequence data or the estimates of gene expression levels (Fig. 1B; Supplemental Figs. S5–S7).

Genetic diversity and endangered status

We used the RNA-seq data to identify single nucleotide polymorphisms (SNPs) in genes with sequence coverage levels that were sufficient for the accurate identification of heterozygous sites (minimum 15× per strand, 30× total, per individual coverage, for each individual in a species) (Supplemental Methods). On average, we obtained genotypes for 787,744 bp (±341,326 bp) per species from the coding regions of an average of 1170 genes. We used nucleotide diversity at synonymous sites to estimate putatively neutral levels of genetic diversity for each of the 16 species (Supplemental Table S1). To our knowledge, these are the first published estimates of nuclear genome genetic diversity for all but five of the 16 species in our study.

We used several quality control analyses to test the quality of our SNP genotype calls. For human, chimpanzee, rhesus macaque, aye-aye, and mouse, our genetic diversity estimates are generally comparable to those that have been published previously (Yu et al. 2003; Fischer et al. 2004; Voight et al. 2005; Baines and Harr 2007; Hernandez et al. 2007; Perry et al. 2007; Wall et al. 2008; Perry et al. 2010) (Supplemental Methods). We confirmed that our SNP calling strategy is highly accurate (99.4%) by comparing the human genotypes inferred using our approach to genotypes collected using the Illumina 1M-Duo SNP array platform, with the same human samples (Supplemental Methods; Supplemental Fig. S8). Our human sample includes two European Americans and one individual each of East Asian and African ancestry (Supplemental Fig. S9), and as expected, heterozygosity (based on our RNA-seq SNP calls) was highest in the individual of African descent (synonymous sites: 0.126% vs. 0.086%, 0.087%, and 0.091%) (Supplemental Table S2). We also used traditional Sanger sequencing to validate small subsets of SNPs in four species (21/23 human SNPs, 15/16 rhesus macaque SNPs, 21/23 Coquerel's sifaka SNPs, and 15/19 black and white ruffed lemur SNPs were successfully validated [see Supplemental Methods; Supplemental Table S3]), and we evaluated genotype accuracy more generally by assessing consistency among SNPs identified from subsampled sets of reads for each individual of each species (Supplemental Table S4). Finally, we observed an inverse relationship between synonymous site diversity and the ratio of nonsynonymous to synonymous site diversity within each species, as predicted by Nearly Neutral theory (Kimura et al. 1963) (Supplemental Methods; Supplemental Fig. S10). Put together, these analyses suggest that our SNP calling approach performs well.

We then focused on the relationship between nucleotide diversity and conservation status. The conservation status of the species in our study ranges from Least Concern to Critically Endangered, according to the International Union for Conservation of Nature (IUCN) Red List of Threatened Species (International Union for Conservation of Nature 2010). In general, we found no obvious relationship between genetic diversity and conservation status (Fig. 2). The two most endangered primates in our study, the black and white ruffed lemur and the Coquerel's sifaka, have the highest levels of genetic diversity, 3.1 and 5.7 times that of human (synonymous site π = 0.375% from 137,141 synonymous sites, and π = 0.681% from 156,121 sites), respectively. Genetic diversity in humans (π = 0.119% from 167,756 sites) is relatively low compared to other primates. However, the genetic diversity estimate for aye-ayes is substantially lower than that of humans (π = 0.073% from 197,784 sites). Intra-individual estimates of heterozygosity (Supplemental Table S2) for wild-caught animals among each of our Coquerel's sifaka, black and white ruffed lemur, and aye-aye samples suggest that our observations for these species cannot be explained by population structure or by captive population outbreeding strategies (Supplemental Methods).

View larger version:

Download as PowerPoint Slide

Figure 2.

Relationship between genetic diversity and IUCN Red List endangered status. We show average pairwise nucleotide diversity, π, for synonymous sites, as an estimate of neutral levels of genetic diversity for each species. With the exception of the aye-aye, the lemurs in our study tend to have high levels of genetic diversity relative to other primates. The two species in our study considered most endangered by the IUCN, the black and white ruffed lemur and Coquerel's sifaka, have the highest levels of estimated genetic diversity among primates. The relatively low observed genetic diversity estimates for marmoset, armadillo, and opossum may not reflect those that might otherwise be obtained from natural populations, because the individuals from these species in our study are from managed laboratory research colonies.

Gene structure evolution

We proceeded to study patterns of inter-species divergence in exon usage by searching multiple alignments of all available gene sequences across the 16 species for gaps ≥ 50 bp. Since the gene sequences were assembled from RNA sequencing reads, such gaps may indicate fixed inter-species differences in gene/exon structure. Considering the large total divergence time among the species in our study, we were surprised to observe near complete conservation of exon structure among the assembled genes. Specifically, we found only 308 potential exon structure changes across the entire phylogeny. Further analysis of the de Bruijn graph data and multispecies alignments for these genes (Supplemental Methods) suggested that 304 of these gaps were either associated with evidence for alternative splicing or could be explained as alignment artifacts. For example, exon 8 of the KIAA0494 gene was missing from the assembly of all five lemur species in the study, but our analysis of the de Bruijn graph suggested that this result was due to alternative splicing rather than a fixed difference in gene structure between lemurs and other primates. For validation, we sequenced KIAA0494 exon 8 from genomic DNA of lemurs. Alignments of the RNA-seq reads from each species to the predicted exon junctions (Fig. 3A), supported by quantitative PCR experiments (Supplemental Fig. S11), show that exon 8 is usually, but not always, skipped in lemurs, in contrast to the splicing pattern observed in other species.

View larger version:

Download as PowerPoint Slide

Figure 3.

Exon structure divergence and evolution. (A) Phylogenetic shift in splicing and exon usage in the KIAA0494 gene. For each species, the y-axis depicts the number of RNA-seq reads spanning junctions of exons 6–9 (x-axis) based on human reference genome exon positions. Lines representing the number of reads spanning the exon 7 to 9 junction, observed in the overwhelming majority of inferred transcripts in lemurs but only rarely in other species, are highlighted in red. Junctions representing the most common transcript in each species are bolded. (B) Extreme divergence in exon skipping is rare. We mapped our RNA-seq read data against the human and rhesus macaque reference genome sequences to assess patterns of exon usage divergence independently of our assembled gene database (see Supplemental Methods). Shown is a heatmap depicting human vs. rhesus macaque exon skip rates. Included in this plot are all exons with at least 10 reads covering junctions, summed across all individuals of both species, and at least eight reads entering, exiting, or skipping the exon in each species. The number of exons with significant, complete divergence skip rates (i.e., exons always skipped in one species and never skipped in the other; three total), are shown by arrows in the upper left and lower right boxes of the heatmap. (C) Density plot comparing the absolute difference in human versus rhesus macaque exon skip rates to estimated expression levels (human) for the gene containing that exon, for all identified exons with evidence of alternative splicing or differential exon usage, regardless of expression level. Mean and 95th/fifth percentiles are depicted as solid and dashed red lines, respectively. Lower-expressed genes are more likely to harbor exons with larger between-species exon usage differences, reflecting either statistical artifacts or relatively lower constraint on exon structure and splicing on lower-expressed genes, or both.

Thus, using these approaches, we could find only four examples of actual fixed inter-species changes in exon structure in liver-expressed genes, in which certain exons are always skipped in at least one species but never in others. An independent analysis, restricted to species for which sequenced genomes were available, yielded similar results of strong exon structure conservation (Fig. 3B,C; Supplemental Methods). Our results suggest that the absolute gain or loss of individual, nonrepetitive exons has occurred only rarely among single-copy, intermediately and highly expressed genes in primate evolution.

Natural selection at the gene regulatory and sequence levels

Finally, we identified patterns of within- and between-species variation in the sequence and gene expression data that were consistent with the action of positive or directional selection. These analyses were based on lineage-specific ratios of the rates of nonsynonymous to synonymous substitution (d_N/d_S) estimated by maximum-likelihood (Yang 2007) and by testing for relatively large lineage-specific changes in gene expression levels using a Brownian motion model of gene expression evolution (e.g., Bedford and Hartl 2009), respectively (Supplemental Methods). Importantly, our sampling scheme allowed us to infer the action of natural selection on both external and ancestral branches of the phylogeny (for examples, see Supplemental Fig. S12). Overall, we identified 499 candidate genes whose rapid sequence or regulatory evolution may have played important roles in the adaptations of individual species or the ancestors of subsets of those species (see Supplemental Tables S5, S6 for a complete gene list). While it is unlikely that all 499 candidate genes were subjected to positive or directional selection at the amino acid sequence or regulatory levels, this set of candidates is likely enriched for such genes. Given the important metabolism and detoxification functions of the liver, some of these changes could reflect adaptations related to the extensive dietary diversity among the species in our study.

The relevant fossil record is particularly limited for ancestral primates (Tavare et al. 2002). Therefore, identifying conspicuous signatures of natural selection on this branch was of particular interest. For example, we found a strong signal of positive selection in the ancestral primate lineage in the gamma-glutamyl hydrolase (GGH) gene (Fig. 4A). The GGH enzyme is critical for folate metabolism and homeostasis and was previously shown to have exopeptidase activity in humans but endopeptidase activity in rodents, along with other enzymatic activity differences (Yao et al. 1996). Thus, the human-rodent functional differences in this protein might be explained by adaptive nucleotide substitutions that occurred in ancestral primate lineages. At the gene regulatory level, of the 33 top-ranked genes with relatively large ancestral primate lineage shifts in expression levels, nine are involved in peroxisome functioning, corresponding to an 18-fold enrichment over that expected by chance alone (based on Gene Ontology functional annotations; FDR = 7 × 10⁻⁹; genes PEX7, HACL1, IDE, SCP2, PEX13, LONP2, ACOX3, MGST1, and PHYH) (Fig. 4B; Supplemental Fig. S12). Peroxisomes are organelles that function in the breakdown of long-chain fatty acids by β-oxidation, the detoxification of hydrogen peroxide by catalase, the synthesis of bile acids, and cholesterol homeostasis in general (Islinger et al. 2010). We note that we were unable to identify any experimental-based evidence in the literature of peroxisomal functioning for the product of MGST; the GO functional annotation in this case might be erroneous.

View larger version:

Download as PowerPoint Slide

Figure 4.

Positive and directional selection in the ancestral primate branch. (A) Ratios of the maximum likelihood-estimated (Yang 2007) rates of nonsynonymous (amino acid changing) to synonymous substitution (d_N/d_S) for the GGH gene shown directly above each branch. Values of d_N/d_S > 1, highlighted in red and with the number of estimated nonsynonymous (N) and synonymous (S) substitutions shown, are consistent with the past action of positive selection on several ancestral branches of the tree. (B) Relative gene expression branch lengths estimated from 4562 genes without peroxisomal functions and from 60 peroxisomal genes, considering genes with sufficient species representation for analysis of the ancestral primate branch (see Supplemental Methods). The ancestral primate branch, highlighted in red, is relatively 4.4 times longer among the peroxisomal gene set. Nine of the 33 top-ranked genes for patterns of expression consistent with directional selection on the ancestral primate lineage function play roles in the functioning of the peroxisome, significantly more than expected by chance (FDR = 7 × 10⁻⁹). The two phylogenies are plotted such that the sums of all branch lengths, excepting the ancestral primate lineage, are equal. The relative lengths of the ancestral primate branches of each phylogeny are shown (the value for the nonperoxisomal genes phylogeny was set to 1.0).

Previous Section Next Section

Discussion

We collected RNA-seq data from the liver transcriptomes of multiple individuals from each of 16 mammalian species, including 12 primates, and performed de novo assembly of an average of 5721 genes per species. For many of the primate species in our study, our effort represents the first opportunity to examine nucleotide sequence, gene expression, exon structure, and genetic diversity data on a genomic scale.

We developed a new transcriptome assembly algorithm, primarily because none were available when we initiated our study. Several alternative algorithms that can be used for transcriptome assembly have been released recently, including Trans-ABySS (Robertson et al. 2010) and Oases (http://www.ebi.ac.uk/∼zerbino/oases/), which function by interpreting output from the whole genome assemblers, ABySS (Simpson et al. 2009) and Velvet (Zerbino and Birney 2008), respectively, and Trinity (Grabherr et al. 2011), which directly performs de novo transcriptome assembly. All of these algorithms, including ours, use the de Bruijn graph framework (Pevzner et al. 2001).

We have not evaluated and compared the performance of the different algorithms, as this is beyond the scope of our study. Our assembly method differs from other existing tools in several respects, as described in the Supplemental Methods. In particular, our algorithm was specifically developed to facilitate subsequent comparative genomic analyses; it is unique in its use of a sequence similarity-based comparative assembly approach, thereby establishing multispecies gene orthology as a property of the initial assembly. This aspect of our approach facilitates direct inter-species comparison of gene sequences and expression levels in an evolutionary framework.

Comparative primate genomics

Whereas previous primate comparative genomic studies have focused mainly on humans, apes, and Old World monkeys, we were able to examine the evolutionary histories of gene sequences and expression levels in the context of a relatively comprehensive primate phylogeny. Our sample of species included representatives from both primate suborders: haplorhines (humans, chimpanzees, Old and New World monkeys) and strepsirrhines (lemurs and lorises). Thus, an important property of our study design is that it provided one of the first opportunities to identify evolutionary patterns both among lemurs and in ancestral primate lineages, without the need for full genome sequences from these species.

To limit errors in the de novo assembly and orthologous gene identification process, it was necessary to discard data from duplicated genes. Additionally, we assembled genes from nonhuman species on the basis of small-scale sequence similarity to human RefSeq genes. Our analyses, therefore, were focused on single-copy genes expressed in the liver and present in the human genome. Of such genes, our set of 499 candidate genes provides an important starting point for developing hypotheses concerning the adaptive evolutionary histories of previously unstudied extant species and ancestral primate lineages. For example, our observation of an 18-fold enrichment of peroxisomal genes among those whose regulation possibly evolved under directional selection in the ancestral primate lineage may be of particular interest. While there are known functional differences between macaque and rodent peroxisomes (Hoivik et al. 2004), comparative data from dogs suggested that those differences are likely explained by derived changes in rodents, not primates (Foxworthy et al. 1990). Differences have also been observed in peroxisomal gene functioning and peroxisomal lipid metabolism between apes or humans and other primates (Somel et al. 2008; Keebaugh and Thomas 2010; Watkins et al. 2010). In contrast, our results suggest a different, major biological distinction in the regulation of peroxisome-related genes between all primates and other mammals, possibly driven by adaptive events that occurred in the ancestral primate lineage. Therefore, characterization of the functional consequences of this regulatory difference may ultimately lead to new insights concerning a little understood, but critical, time period in primate evolution.

Lemur genetic diversity

The recent history of rapid deforestation, habitat loss, and political instability in Madagascar has placed many lemurs at particular risk. Prior to this study, nuclear genetic diversity data based on nucleotide sequence data were not available for any lemur besides the aye-aye (Perry et al. 2007), although genetic diversity estimates based on microsatellite data are available for several other species (e.g., Fredsted et al. 2005; Louis et al. 2005; Lawler 2008; Pastorini et al. 2009; Quemere et al. 2010; Razakamaharavo et al. 2010). Genetic diversity data can have high importance in developing informed and effective conservation strategies, due to the association between genetic diversity and the risk of extinction (Frankham 2005; Palstra and Ruzzante 2008). For example, conservation biologists are faced with particular challenges when working with species with low genetic diversity (e.g., the cheetah) (O'Brien et al. 1983, 1985; O'Brien and Johnson 2005).

When we compared levels of neutral genetic diversity estimated from synonymous sites to the conservation status established by the IUCN for each species (International Union for Conservation of Nature 2010), we did not observe a clear pattern of association (Fig. 2). This result is not necessarily a surprise for the lemur species in this study, considering that the most extreme deforestation and habitat loss in Madagascar occurred only in the last 50 yr, likely too recent to alone induce dramatic effects on lemur genetic diversity. Yet, observations of unusually low genetic diversity for lemur species currently considered less endangered or of high genetic diversity for more endangered species may impact conservation priorities and practicalities.

Aye-ayes, considered only Near Threatened by the IUCN, have the lowest estimated genetic diversity of any species in our study. Recently, lemur conservation scientists have recommended that the status of aye-ayes be elevated to Endangered (Mittermeier et al. 2010). We would support this notion based on the combination of the genetic diversity results reported here and our still-limited knowledge of aye-aye behavior. Specifically, while aye-ayes have a broad species distribution across Madagascar, they are largely solitary, with huge individual ranges and low population densities (Ancrenaz et al. 1994)—a potentially ominous demographic profile in the face of continued forest fragmentation and already low genetic diversity.

In contrast, two of the most endangered species, the black and white ruffed lemur and Coquerel's sifaka, have the highest genetic diversity estimates of any primate—3.1 and 5.7 times that of humans, respectively. The Critically Endangered black and white ruffed lemur has experienced rapid population declines in the last quarter century due to habitat disturbance, their ecological reliance on primary forest, and extensive human hunting pressure (International Union for Conservation of Nature 2010). They have a predominantly frugivorous diet and, as major seed dispersers, could be considered critical to the long-term viability of some of Madagascar's forests. Relatively high genetic diversity should benefit black and white ruffed lemur conservation and reintroduction efforts.

Previous Section Next Section

Conclusion

With the advent and continued development of new sequencing technologies and assembly methods, we are able to easily characterize natural genetic and regulatory variation in a wide range of species. We are no longer limited to working on species with publicly available, sequenced genomes, which are mostly model organisms relevant to human disease studies. We, therefore, expect large and broad comparative genomic studies to become common. Such studies will increase our understanding of adaptation by allowing us to reconstruct events that occurred on ancestral lineages at unprecedented resolution. This framework also provides an opportunity to truly harness genomic studies in the service of conservation efforts (Allendorf et al. 2010; Frankham 2010).

Previous Section Next Section

Methods

Overview

We isolated total RNA from liver tissues harvested within 4 h of death and then stored at −80°C to preserve RNA quality. Following mRNA isolation with oligo-dT magnetic beads (Invitrogen), RNA libraries were prepared and sequenced on an Illumina Genome Analyzer IIx for 76 bp from both ends of each sequence fragment (paired-end; 2 × 76 bp), using one flowcell lane per sample. Since no sequenced genome was available for most of the species in our study, we developed a de Bruijn graph-based approach (Pevzner et al. 2001) for de novo assembly of the transcriptome of each species and simultaneous matching of gene orthologs (described in detail in Supplemental Methods). We generated multispecies alignments of the assembled gene sequences to study the evolution of gene coding sequences (Yang 2007). We also aligned the RNA-seq reads from each individual to the assembled gene sequences of each respective species for SNP analysis and estimation and evolutionary analysis of gene expression levels.

Estimating gene expression levels

To estimate the expression level of each gene, for each sample we first aligned the sequenced reads against a reference containing the sequences of the set of assembled genes for the appropriate species using BWA (Li and Durbin 2009) with default parameters, considering only uniquely mapped reads. For this analysis, we analyzed separately the two reads of each pair. To account for alternative splicing, individual reads not aligned in the first step were evaluated and scored using a gapped alignment approach (Pickrell et al. 2010), described in detail in Supplemental Methods.

For our evolutionary analysis of gene expression levels, we chose to consider orthologous gene regions across species rather than the fully assembled gene sequence from each species. That is, if the full gene sequence was not assembled for every species, then we restricted our analysis to the specific region of the gene that was commonly assembled across species. This approach makes it less likely that our inter-species comparison of gene expression levels would be affected by sequencing biases or the inclusion of alternatively spliced exons in some species only. To do so, we performed a multispecies alignment (Bradley et al. 2009) and identified the maximum orthologous region that was fully aligned across all species. Reads contributing to a gene's expression level were restricted to those falling in the maximum orthologous region, which was itself constrained to exclude noncoding regions (i.e., UTRs were not included in the gene expression analysis). We used the total number of reads mapping to the identified orthologous region of a transcript as a measure of its expression level. The data were then normalized and adjusted for GC content using procedures described in full in Supplemental Methods.

SNP identification

We aligned all reads from each individual to the database of consensus sequence transcripts that was assembled for the relevant species, using the default parameters of BWA (Li and Durbin 2009). In the final preparation step of the RNA-seq libraries, there is a PCR amplification step that uses the ligated adapter sequences as primer sites for consistent amplification. To help limit any bias from PCR amplification in the SNP identification process, we performed a filtering step to consider only one read pair from each uniquely aligned starting position and strand. Specifically, if two paired reads each had the same start position for read 1 but different start positions from read 2, then these reads were considered to have originated independently and were both kept in the analysis. When more than one paired read had identical aligned start positions (at both ends), we kept one read at random and excluded the remaining reads from further analysis. For this filtering decision, we ignored the alignment quality score, as single nucleotide differences from the consensus sequence due to true SNPs could have subtle effects on that score. That said, we did not consider any base call with a phred-scaled quality score lower than 30.

To establish SNP identification criteria, we systematically assessed genotyping accuracy as a function of multiple different per-strand coverage requirements and “SNP call definitions” based on the proportion of the most common nucleotide at each site. By “SNP call definition,” we mean the threshold at which a heterozygous site would be called, when the proportion of reads with the most common nucleotide at a given position was at or below that threshold (for reads aligning to both strands). By requiring the SNP definition to be met by reads mapped to each strand, we limited the effects of potential strand-specific sequencing biases (Nakamura et al. 2011). Examples of SNP call definitions that we considered were ≤0.6, ≤0.65, ≤0.7, ≤0.75, etc.

To determine the coverage requirement and SNP call definition thresholds, we compared SNP genotypes from the 1M-Duo Illumina SNP array platform data collected for each of the four human samples in the study to the variants inferred from the RNA-seq data using our method (Supplemental Fig. S8). Based on this analysis, we chose to assess all sites covered by a minimum of 15 sequence reads per strand (minimum of 30 total reads), and, of such sites, we classified as heterozygous those for which the proportion of the most common nucleotide was ≤0.7 on each strand. This approach for SNP calling is generally similar to that which we previously used with genomic DNA sequencing data and found to result in highly accurate SNP identification (Perry et al. 2010).

Finally, we performed a subsampling analysis with the reads from each individual. For this analysis, reads were randomly distributed into two subsets. SNPs were identified from each subset of the data using the coverage and SNP call definition threshold criteria described above. We then determined the consistency of SNP inferences in the subsampled data within each individual. We removed three samples—one chimpanzee and two aye-ayes—from further SNP analysis due to relatively low concordance in heterozygous site identification in the subsample analysis (Supplemental Table S4).

For each species, we estimated genotypes for all sites with sufficient coverage for SNP identification in all individuals (n = 2 for armadillo and aye-aye, n = 3 for chimpanzee, n = 4 for all other species). We classified all heterozygous positions as well as any sites with homozygous differences between individuals as SNPs. Species-level estimates of genetic diversity π (average pairwise genetic distance) and θ (sample-size corrected proportion of segregating sites) were computed for all genes with at least 100 sites with sufficient coverage for SNP identification in each individual of that species and are provided in Supplemental Table S1.

Previous Section Next Section

Data access

Paired end 76 × 76-bp sequencing data obtained in this study have been submitted to the NCBI Sequence Read Archive (SRA) (http://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi) under accession number SRA046085. The transcript assembly code used in this study is available at http://pritch.bsd.uchicago.edu/software.html. The full database of assembled gene sequences, full gene multispecies alignments, orthologous coding region multispecies alignments, lineage-specific d_N/d_S results, normalized gene expression estimates, log likelihood ratios for lineage-specific expression level changes, and the identified SNPs and genotype data for each species are available as a Supplemental Database file on the Genome Research website and at http://giladlab.uchicago.edu/data.html.

Previous Section Next Section

Acknowledgments

We thank the Duke Lemur Center, National Disease Research Interchange, Yerkes National Primate Research Center, Southwest Foundation for Biomedical Research, Alpha Genesis, David Fitzpatrick, Julie Heiner, Matt Dean, Michael Nachman, and Richard Truman for providing the samples used in this study. The lemur, loris, and bushbaby photographs in the phylogeny figures were provided by David Haring, Duke Lemur Center. Marmoset, treeshrew, mouse, armadillo, and opossum photographs are from Wikimedia Commons. We thank Z. Gauhar, P. Gagneux, E. Louis, and O. Ryder for useful discussions and/or comments on the manuscript. This work was funded by the Howard Hughes Medical Institute to J.K.P., and by NIH grant GM077959 to Y.G. G.H.P. was supported by N.I.H. fellowship F32GM085998.

Previous Section Next Section

Footnotes

↵9 Corresponding authors.

E-mail ghp3{at}psu.edu.

E-mail pritch{at}uchicago.edu.

E-mail gilad{at}uchicago.edu.
[Supplemental material is available for this article.]
Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.130468.111.

Received August 10, 2011.
Accepted December 2, 2011.

Copyright © 2012 by Cold Spring Harbor Laboratory Press

Freely available online through the Genome Research Open Access option.

Previous Section

References

↵
1. Abzhanov A,
2. Kuo WP,
3. Hartmann C,
4. Grant BR,
5. Grant PR,
6. Tabin CJ
Abzhanov A, Kuo WP, Hartmann C, Grant BR, Grant PR, Tabin CJ. 2006. The calmodulin pathway and evolution of elongated beak morphology in Darwin's finches. Nature 442: 563–567.
CrossRef Medline Google Scholar
↵
1. Alexander RP,
2. Fang G,
3. Rozowsky J,
4. Snyder M,
5. Gerstein MB
Alexander RP, Fang G, Rozowsky J, Snyder M, Gerstein MB. 2010. Annotating noncoding regions of the genome. Nat Rev Genet 11: 559–571.
CrossRef Medline Google Scholar
↵
1. Allendorf FW,
2. Hohenlohe PA,
3. Luikart G
Allendorf FW, Hohenlohe PA, Luikart G. 2010. Genomics and the future of conservation genetics. Nat Rev Genet 11: 697–709.
CrossRef Medline Google Scholar
↵
1. Ancrenaz M,
2. Lackman-Ancrenaz I,
3. Mundy N
Ancrenaz M, Lackman-Ancrenaz I, Mundy N. 1994. Field observations of aye-ayes (Daubentonia madagascariensis) in Madagascar. Folia Primatol (Basel) 62: 22–36.
CrossRef Medline Google Scholar
↵
1. Baines JF,
2. Harr B
Baines JF, Harr B. 2007. Reduced X-linked diversity in derived populations of house mice. Genetics 175: 1911–1921.
CrossRef Medline Google Scholar
↵
1. Bedford T,
2. Hartl DL
Bedford T, Hartl DL. 2009. Optimization of gene expression by natural selection. Proc Natl Acad Sci 106: 1133–1138.
Abstract/FREE Full Text
↵
1. Bejerano G,
2. Pheasant M,
3. Makunin I,
4. Stephen S,
5. Kent WJ,
6. Mattick JS,
7. Haussler D
Bejerano G, Pheasant M, Makunin I, Stephen S, Kent WJ, Mattick JS, Haussler D. 2004. Ultraconserved elements in the human genome. Science 304: 1321–1325.
Abstract/FREE Full Text
↵
1. Blekhman R,
2. Oshlack A,
3. Chabot AE,
4. Smyth GK,
5. Gilad Y
Blekhman R, Oshlack A, Chabot AE, Smyth GK, Gilad Y. 2008. Gene regulation in primates evolves under tissue-specific selection pressures. PLoS Genet 4: e1000271. doi: 10.1371/journal.pgen.1000271.
CrossRef Medline Google Scholar
↵
1. Bradley RK,
2. Roberts A,
3. Smoot M,
4. Juvekar S,
5. Do J,
6. Dewey C,
7. Holmes I,
8. Pachter L
Bradley RK, Roberts A, Smoot M, Juvekar S, Do J, Dewey C, Holmes I, Pachter L. 2009. Fast statistical alignment. PLoS Comput Biol 5: e1000392. doi: 10.1371/journal.pcbi.1000392.
CrossRef Medline Google Scholar
↵
1. Brooks TM,
2. Mittermeier RA,
3. Mittermeier CG,
4. da Fonseca GAB,
5. Rylands AB,
6. Konstant WR,
7. Flick P,
8. Pilgrim J,
9. Oldfield S,
10. Magin G
Brooks TM, Mittermeier RA, Mittermeier CG, da Fonseca GAB, Rylands AB, Konstant WR, Flick P, Pilgrim J, Oldfield S, Magin G, et al. 2002. Habitat loss and extinction in the hotspots of biodiversity. Conserv Biol 16: 909–923.
CrossRef Google Scholar
↵
1. Caceres M,
2. Lachuer J,
3. Zapala MA,
4. Redmond JC,
5. Kudo L,
6. Geschwind DH,
7. Lockhart DJ,
8. Preuss TM,
9. Barlow C
Caceres M, Lachuer J, Zapala MA, Redmond JC, Kudo L, Geschwind DH, Lockhart DJ, Preuss TM, Barlow C. 2003. Elevated gene expression levels distinguish human from nonhuman primate brains. Proc Natl Acad Sci 100: 13030–13035.
Abstract/FREE Full Text
↵
1. Cannarozzi G,
2. Schneider A,
3. Gonnet G
Cannarozzi G, Schneider A, Gonnet G. 2007. A phylogenomic study of human, dog, and mouse. PLoS Comput Biol 3: e2. doi: 10.1371/journal.pcbi.0030002.
CrossRef Medline Google Scholar
↵

The Chimpanzee Sequencing and Analysis Consortium. 2005. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437: 69–87.

CrossRef Medline Google Scholar
↵

Drosophila 12 Genomes Consortium. 2007. Evolution of genes and genomes on the Drosophila phylogeny. Nature 450: 203–218.

CrossRef Medline Google Scholar
↵
1. Fischer A,
2. Wiebe V,
3. Paabo S,
4. Przeworski M
Fischer A, Wiebe V, Paabo S, Przeworski M. 2004. Evidence for a complex demographic history of chimpanzees. Mol Biol Evol 21: 799–808.
Abstract/FREE Full Text
↵
1. Foxworthy PS,
2. White SL,
3. Hoover DM,
4. Eacho PI
Foxworthy PS, White SL, Hoover DM, Eacho PI. 1990. Effect of ciprofibrate, bezafibrate, and LY171883 on peroxisomal β-oxidation in cultured rat, dog, and rhesus monkey hepatocytes. Toxicol Appl Pharmacol 104: 386–394.
CrossRef Medline Google Scholar
↵
1. Frankham R
Frankham R. 2005. Genetics and extinction. Biol Conserv 126: 131–140.
CrossRef Google Scholar
↵
1. Frankham R
Frankham R. 2010. Challenges and opportunities of genetic approaches to biological conservation. Biol Conserv 143: 1919–1927.
CrossRef Google Scholar
↵
1. Fredsted T,
2. Pertoldi C,
3. Schierup MH,
4. Kappeler PM
Fredsted T, Pertoldi C, Schierup MH, Kappeler PM. 2005. Microsatellite analyses reveal fine-scale genetic structure in grey mouse lemurs (Microcebus murinus). Mol Ecol 14: 2363–2372.
Medline Google Scholar
↵
1. Gilad Y,
2. Oshlack A,
3. Smyth GK,
4. Speed TP,
5. White KP
Gilad Y, Oshlack A, Smyth GK, Speed TP, White KP. 2006. Expression profiling in primates reveals a rapid evolution of human transcription factors. Nature 440: 242–245.
CrossRef Medline Google Scholar
↵
1. Gnirke A,
2. Melnikov A,
3. Maguire J,
4. Rogov P,
5. LeProust EM,
6. Brockman W,
7. Fennell T,
8. Giannoukos G,
9. Fisher S,
10. Russ C
Gnirke A, Melnikov A, Maguire J, Rogov P, LeProust EM, Brockman W, Fennell T, Giannoukos G, Fisher S, Russ C, et al. 2009. Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing. Nat Biotechnol 27: 182–189.
CrossRef Medline Google Scholar
↵
1. Grabherr MG,
2. Haas BJ,
3. Yassour M,
4. Levin JZ,
5. Thompson DA,
6. Amit I,
7. Adiconis X,
8. Fan L,
9. Raychowdhury R,
10. Zeng Q
Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, et al. 2011. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29: 644–652.
CrossRef Medline Google Scholar
↵
1. Green GM,
2. Sussman RW
Green GM, Sussman RW. 1990. Deforestation history of the eastern rain forests of Madagascar from satellite images. Science 248: 212–215.
Abstract/FREE Full Text
↵
1. Harper GJ,
2. Steininger MK,
3. Tucker CJ,
4. Juhn D,
5. Hawkins F
Harper GJ, Steininger MK, Tucker CJ, Juhn D, Hawkins F. 2007. Fifty years of deforestation and forest fragmentation in Madagascar. Environ Conserv 34: 325–333.
Google Scholar
↵
1. Hernandez RD,
2. Hubisz MJ,
3. Wheeler DA,
4. Smith DG,
5. Ferguson B,
6. Rogers J,
7. Nazareth L,
8. Indap A,
9. Bourquin T,
10. McPherson J
Hernandez RD, Hubisz MJ, Wheeler DA, Smith DG, Ferguson B, Rogers J, Nazareth L, Indap A, Bourquin T, McPherson J, et al. 2007. Demographic histories and patterns of linkage disequilibrium in Chinese and Indian rhesus macaques. Science 316: 240–243.
Abstract/FREE Full Text
↵
1. Hoivik DJ,
2. Qualls CW Jr.,
3. Mirabile RC,
4. Cariello NF,
5. Kimbrough CL,
6. Colton HM,
7. Anderson SP,
8. Santostefano MJ,
9. Morgan RJ,
10. Dahl RR
Hoivik DJ, Qualls CW Jr., Mirabile RC, Cariello NF, Kimbrough CL, Colton HM, Anderson SP, Santostefano MJ, Morgan RJ, Dahl RR, et al. 2004. Fibrates induce hepatic peroxisome and mitochondrial proliferation without overt evidence of cellular proliferation and oxidative stress in cynomolgus monkeys. Carcinogenesis 25: 1757–1769.
Abstract/FREE Full Text
↵
1. Horvath JE,
2. Willard HF
Horvath JE, Willard HF. 2007. Primate comparative genomics: Lemur biology and evolution. Trends Genet 23: 173–182.
CrossRef Medline Google Scholar
↵

International Union for Conservation of Nature. 2010. Red List of Threatened Species Version 2010.4. http://www.iucnredlist.org.
1. Islinger M,
2. Cardoso MJ,
3. Schrader M
Islinger M, Cardoso MJ, Schrader M. 2010. Be different–the diversity of peroxisomes in the animal kingdom. Biochim Biophys Acta 1803: 881–897.
CrossRef Medline Google Scholar
↵
1. Jiang Z,
2. Tang H,
3. Ventura M,
4. Cardone MF,
5. Marques-Bonet T,
6. She X,
7. Pevzner PA,
8. Eichler EE
Jiang Z, Tang H, Ventura M, Cardone MF, Marques-Bonet T, She X, Pevzner PA, Eichler EE. 2007. Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolution. Nat Genet 39: 1361–1368.
CrossRef Medline Google Scholar
↵
1. Keebaugh AC,
2. Thomas JW
Keebaugh AC, Thomas JW. 2010. The evolutionary fate of the genes encoding the purine catabolic enzymes in hominoids, birds, and reptiles. Mol Biol Evol 27: 1359–1369.
Abstract/FREE Full Text
↵
1. Khaitovich P,
2. Hellmann I,
3. Enard W,
4. Nowick K,
5. Leinweber M,
6. Franz H,
7. Weiss G,
8. Lachmann M,
9. Paabo S
Khaitovich P, Hellmann I, Enard W, Nowick K, Leinweber M, Franz H, Weiss G, Lachmann M, Paabo S. 2005. Parallel patterns of evolution in the genomes and transcriptomes of humans and chimpanzees. Science 309: 1850–1854.
Abstract/FREE Full Text
↵
1. Kimura M,
2. Maruyama T,
3. Crow JF
Kimura M, Maruyama T, Crow JF. 1963. The mutation load in small populations. Genetics 48: 1303–1312.
Medline Google Scholar
↵
1. Lawler RR
Lawler RR. 2008. Testing for a historical population bottleneck in wild Verreaux's sifaka (Propithecus verreauxi verreauxi) using microsatellite data. Am J Primatol 70: 990–994.
CrossRef Medline Google Scholar
↵
1. Li H,
2. Durbin R
Li H, Durbin R. 2009. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25: 1754–1760.
Abstract/FREE Full Text
↵
1. Locke DP,
2. Hillier LW,
3. Warren WC,
4. Worley KC,
5. Nazareth LV,
6. Muzny DM,
7. Yang SP,
8. Wang Z,
9. Chinwalla AT,
10. Minx P
Locke DP, Hillier LW, Warren WC, Worley KC, Nazareth LV, Muzny DM, Yang SP, Wang Z, Chinwalla AT, Minx P, et al. 2011. Comparative and demographic analysis of orang-utan genomes. Nature 469: 529–533.
CrossRef Medline Google Scholar
↵
1. Louis EE,
2. Ratsimbazafy JH,
3. Razakamaharauo VR,
4. Pierson DJ,
5. Barber RC,
6. Brenneman RA
Louis EE, Ratsimbazafy JH, Razakamaharauo VR, Pierson DJ, Barber RC, Brenneman RA. 2005. Conservation genetics of black and white ruffed lemurs, Varecia variegata, from Southeastern Madagascar. Anim Conserv 8: 105–111.
CrossRef Google Scholar
↵
1. Mittermeier RA,
2. Ganzhorn JU,
3. Konstant WR,
4. Glander K,
5. Tattersall I,
6. Groves CP,
7. Rylands AB,
8. Hapke A,
9. Ratsimbazafy J,
10. Mayor MI
Mittermeier RA, Ganzhorn JU, Konstant WR, Glander K, Tattersall I, Groves CP, Rylands AB, Hapke A, Ratsimbazafy J, Mayor MI, et al. 2008. Lemur diversity in Madagascar. Int J Primatol 29: 1607–1656.
CrossRef Google Scholar
↵
1. Mittermeier RA,
2. Louis EE,
3. Richardson M,
4. Schwitzer C,
5. Langrand O,
6. Rylands AB,
7. Hawkins F,
8. Rajaobelina S,
9. Ratsimbazafy J,
10. Rasoloarison R
Mittermeier RA, Louis EE, Richardson M, Schwitzer C, Langrand O, Rylands AB, Hawkins F, Rajaobelina S, Ratsimbazafy J, Rasoloarison R, et al. 2010. Lemurs of Madagascar. Conservation International, Arlington, VA.
Google Scholar
↵
1. Nakamura K,
2. Oshima T,
3. Morimoto T,
4. Ikeda S,
5. Yoshikawa H,
6. Shiwa Y,
7. Ishikawa S,
8. Linak MC,
9. Hirai A,
10. Takahashi H
Nakamura K, Oshima T, Morimoto T, Ikeda S, Yoshikawa H, Shiwa Y, Ishikawa S, Linak MC, Hirai A, Takahashi H, et al. 2011. Sequence-specific error profile of Illumina sequencers. Nucleic Acids Res 39: e90. doi: 10.1093/nar/gkr344.
Abstract/FREE Full Text
↵
1. O'Brien SJ,
2. Johnson WE
O'Brien SJ, Johnson WE. 2005. Big cat genomics. Annu Rev Genomics Hum Genet 6: 407–429.
CrossRef Medline Google Scholar
↵
1. O'Brien SJ,
2. Wildt DE,
3. Goldman D,
4. Merril CR,
5. Bush M
O'Brien SJ, Wildt DE, Goldman D, Merril CR, Bush M. 1983. The cheetah is depauperate in genetic variation. Science 221: 459–462.
Abstract/FREE Full Text
↵
1. O'Brien SJ,
2. Roelke ME,
3. Marker L,
4. Newman A,
5. Winkler CA,
6. Meltzer D,
7. Colly L,
8. Evermann JF,
9. Bush M,
10. Wildt DE
O'Brien SJ, Roelke ME, Marker L, Newman A, Winkler CA, Meltzer D, Colly L, Evermann JF, Bush M, Wildt DE. 1985. Genetic basis for species vulnerability in the cheetah. Science 227: 1428–1434.
Abstract/FREE Full Text
↵
1. Oleksiak MF,
2. Churchill GA,
3. Crawford DL
Oleksiak MF, Churchill GA, Crawford DL. 2002. Variation in gene expression within and among natural populations. Nat Genet 32: 261–266.
CrossRef Medline Google Scholar
↵
1. Palstra FP,
2. Ruzzante DE
Palstra FP, Ruzzante DE. 2008. Genetic estimates of contemporary effective population size: What can they tell us about the importance of genetic stochasticity for wild population persistence? Mol Ecol 17: 3428–3447.
CrossRef Medline Google Scholar
↵
1. Pastorini J,
2. Zaramody A,
3. Curtis DJ,
4. Nievergelt CM,
5. Mundy NI
Pastorini J, Zaramody A, Curtis DJ, Nievergelt CM, Mundy NI. 2009. Genetic analysis of hybridization and introgression between wild mongoose and brown lemurs. BMC Evol Biol 9: 32. doi: 10.1186/1471-2148-9-32.
CrossRef Medline Google Scholar
↵
1. Perelman P,
2. Johnson WE,
3. Roos C,
4. Seuanez HN,
5. Horvath JE,
6. Moreira MA,
7. Kessing B,
8. Pontius J,
9. Roelke M,
10. Rumpler Y
Perelman P, Johnson WE, Roos C, Seuanez HN, Horvath JE, Moreira MA, Kessing B, Pontius J, Roelke M, Rumpler Y, et al. 2011. A molecular phylogeny of living primates. PLoS Genet 7: e1001342. doi: 10.1371/journal.pgen.1001342.
CrossRef Medline Google Scholar
↵
1. Perry GH,
2. Martin RD,
3. Verrelli BC
Perry GH, Martin RD, Verrelli BC. 2007. Signatures of functional constraint at aye-aye opsin genes: The potential of adaptive color vision in a nocturnal primate. Mol Biol Evol 24: 1963–1970.
Abstract/FREE Full Text
↵
1. Perry GH,
2. Marioni JC,
3. Melsted P,
4. Gilad Y
Perry GH, Marioni JC, Melsted P, Gilad Y. 2010. Genomic-scale capture and sequencing of endogenous DNA from feces. Mol Ecol 19: 5332–5344.
CrossRef Medline Google Scholar
↵
1. Pevzner PA,
2. Tang H,
3. Waterman MS
Pevzner PA, Tang H, Waterman MS. 2001. An Eulerian path approach to DNA fragment assembly. Proc Natl Acad Sci 98: 9748–9753.
Abstract/FREE Full Text
↵
1. Pickrell JK,
2. Pai AA,
3. Gilad Y,
4. Pritchard JK
Pickrell JK, Pai AA, Gilad Y, Pritchard JK. 2010. Noisy splicing drives mRNA isoform diversity in human cells. PLoS Genet 6: e1001236. doi: 10.1371/journal.pgen.1001236.
CrossRef Medline Google Scholar
↵
1. Quemere E,
2. Crouau-Roy B,
3. Rabarivola C,
4. Louis EE Jr.,
5. Chikhi L
Quemere E, Crouau-Roy B, Rabarivola C, Louis EE Jr., Chikhi L. 2010. Landscape genetics of an endangered lemur (Propithecus tattersalli) within its entire fragmented range. Mol Ecol 19: 1606–1621.
Medline Google Scholar
↵
1. Razakamaharavo VR,
2. McGuire SM,
3. Vasey N,
4. Louis EE Jr.,
5. Brenneman RA
Razakamaharavo VR, McGuire SM, Vasey N, Louis EE Jr., Brenneman RA. 2010. Genetic architecture of two red ruffed lemur (Varecia rubra) populations of Masoala national park. Primates 51: 53–61.
CrossRef Medline Google Scholar
↵

Rhesus Macaque Genome Sequencing and Analysis Consortium. 2007. Evolutionary and biomedical insights from the rhesus macaque genome. Science 316: 222–234.

Abstract/FREE Full Text
↵
1. Robertson G,
2. Schein J,
3. Chiu R,
4. Corbett R,
5. Field M,
6. Jackman SD,
7. Mungall K,
8. Lee S,
9. Okada HM,
10. Qian JQ
Robertson G, Schein J, Chiu R, Corbett R, Field M, Jackman SD, Mungall K, Lee S, Okada HM, Qian JQ, et al. 2010. De novo assembly and analysis of RNA-seq data. Nat Methods 7: 909–912.
CrossRef Medline Google Scholar
↵
1. Simpson JT,
2. Wong K,
3. Jackman SD,
4. Schein JE,
5. Jones SJ,
6. Birol I
Simpson JT, Wong K, Jackman SD, Schein JE, Jones SJ, Birol I. 2009. ABySS: A parallel assembler for short read sequence data. Genome Res 19: 1117–1123.
Abstract/FREE Full Text
↵
1. Somel M,
2. Creely H,
3. Franz H,
4. Mueller U,
5. Lachmann M,
6. Khaitovich P,
7. Paabo S
Somel M, Creely H, Franz H, Mueller U, Lachmann M, Khaitovich P, Paabo S. 2008. Human and chimpanzee gene expression differences replicated in mice fed different diets. PLoS One 3: e1504. doi: 10.1371/journal.pone.0001504.
CrossRef Medline Google Scholar
↵
1. Tavare S,
2. Marshall CR,
3. Will O,
4. Soligo C,
5. Martin RD
Tavare S, Marshall CR, Will O, Soligo C, Martin RD. 2002. Using the fossil record to estimate the age of the last common ancestor of extant primates. Nature 416: 726–729.
CrossRef Medline Google Scholar
↵
1. Voight BF,
2. Adams AM,
3. Frisse LA,
4. Qian Y,
5. Hudson RR,
6. Di Rienzo A
Voight BF, Adams AM, Frisse LA, Qian Y, Hudson RR, Di Rienzo A. 2005. Interrogating multiple aspects of variation in a full resequencing data set to infer human population size changes. Proc Natl Acad Sci 102: 18508–18513.
Abstract/FREE Full Text
↵
1. Wall JD,
2. Cox MP,
3. Mendez FL,
4. Woerner A,
5. Severson T,
6. Hammer MF
Wall JD, Cox MP, Mendez FL, Woerner A, Severson T, Hammer MF. 2008. A novel DNA sequence database for analyzing human demographic history. Genome Res 18: 1354–1361.
Abstract/FREE Full Text
↵
1. Watkins PA,
2. Moser AB,
3. Toomer CB,
4. Steinberg SJ,
5. Moser HW,
6. Karaman MW,
7. Ramaswamy K,
8. Siegmund KD,
9. Lee DR,
10. Ely JJ
Watkins PA, Moser AB, Toomer CB, Steinberg SJ, Moser HW, Karaman MW, Ramaswamy K, Siegmund KD, Lee DR, Ely JJ, et al. 2010. Identification of differences in human and great ape phytanic acid metabolism that could influence gene expression profiles and physiological functions. BMC Physiol 10: 19. doi: 10.1186/1472-6793-10-19.
CrossRef Medline Google Scholar
↵
1. Yang Z
Yang Z. 2007. PAML 4: Phylogenetic analysis by maximum likelihood. Mol Biol Evol 24: 1586–1591.
Abstract/FREE Full Text
↵
1. Yao R,
2. Schneider E,
3. Ryan TJ,
4. Galivan J
Yao R, Schneider E, Ryan TJ, Galivan J. 1996. Human gamma-glutamyl hydrolase: Cloning and characterization of the enzyme expressed in vitro. Proc Natl Acad Sci 93: 10134–10138.
Abstract/FREE Full Text
↵
1. Yu N,
2. Jensen-Seaman MI,
3. Chemnick L,
4. Kidd JR,
5. Deinard AS,
6. Ryder O,
7. Kidd KK,
8. Li WH
Yu N, Jensen-Seaman MI, Chemnick L, Kidd JR, Deinard AS, Ryder O, Kidd KK, Li WH. 2003. Low nucleotide diversity in chimpanzees and bonobos. Genetics 164: 1511–1518.
Medline Google Scholar
↵
1. Zerbino DR,
2. Birney E
Zerbino DR, Birney E. 2008. Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18: 821–829.
Abstract/FREE Full Text

[1] ↵

Abzhanov A,

Kuo WP,

Hartmann C,

Grant BR,

Grant PR,

Tabin CJ

Abzhanov A, Kuo WP, Hartmann C, Grant BR, Grant PR, Tabin CJ. 2006. The calmodulin pathway and evolution of elongated beak morphology in Darwin's finches. Nature 442: 563–567.

CrossRef Medline Google Scholar

[2] Abzhanov A,

[3] Kuo WP,

[4] Hartmann C,

[5] Grant BR,

[6] Grant PR,

[7] Tabin CJ

[8] ↵

Alexander RP,

Fang G,

Rozowsky J,

Snyder M,

Gerstein MB

Alexander RP, Fang G, Rozowsky J, Snyder M, Gerstein MB. 2010. Annotating noncoding regions of the genome. Nat Rev Genet 11: 559–571.

CrossRef Medline Google Scholar

[9] Alexander RP,

[10] Fang G,

[11] Rozowsky J,

[12] Snyder M,

[13] Gerstein MB

[14] ↵

Allendorf FW,

Hohenlohe PA,

Luikart G

Allendorf FW, Hohenlohe PA, Luikart G. 2010. Genomics and the future of conservation genetics. Nat Rev Genet 11: 697–709.

CrossRef Medline Google Scholar

[15] Allendorf FW,

[16] Hohenlohe PA,

[17] Luikart G

[18] ↵

Ancrenaz M,

Lackman-Ancrenaz I,

Mundy N

Ancrenaz M, Lackman-Ancrenaz I, Mundy N. 1994. Field observations of aye-ayes (Daubentonia madagascariensis) in Madagascar. Folia Primatol (Basel) 62: 22–36.

CrossRef Medline Google Scholar

[19] Ancrenaz M,

[20] Lackman-Ancrenaz I,

[21] Mundy N

[22] ↵

Baines JF,

Harr B

Baines JF, Harr B. 2007. Reduced X-linked diversity in derived populations of house mice. Genetics 175: 1911–1921.

CrossRef Medline Google Scholar

[23] Baines JF,

[24] Harr B

[25] ↵

Bedford T,

Hartl DL

Bedford T, Hartl DL. 2009. Optimization of gene expression by natural selection. Proc Natl Acad Sci 106: 1133–1138.

Abstract/FREE Full Text

[26] Bedford T,

[27] Hartl DL

[28] ↵

Bejerano G,

Pheasant M,

Makunin I,

Stephen S,

Kent WJ,

Mattick JS,

Haussler D

Bejerano G, Pheasant M, Makunin I, Stephen S, Kent WJ, Mattick JS, Haussler D. 2004. Ultraconserved elements in the human genome. Science 304: 1321–1325.

Abstract/FREE Full Text

[29] Bejerano G,

[30] Pheasant M,

[31] Makunin I,

[32] Stephen S,

[33] Kent WJ,

[34] Mattick JS,

[35] Haussler D

[36] ↵

Blekhman R,

Oshlack A,

Chabot AE,

Smyth GK,

Gilad Y

Blekhman R, Oshlack A, Chabot AE, Smyth GK, Gilad Y. 2008. Gene regulation in primates evolves under tissue-specific selection pressures. PLoS Genet 4: e1000271. doi: 10.1371/journal.pgen.1000271.

CrossRef Medline Google Scholar

[37] Blekhman R,

[38] Oshlack A,

[39] Chabot AE,

[40] Smyth GK,

[41] Gilad Y

[42] ↵

Bradley RK,

Roberts A,

Smoot M,

Juvekar S,

Do J,

Dewey C,

Holmes I,

Pachter L

Bradley RK, Roberts A, Smoot M, Juvekar S, Do J, Dewey C, Holmes I, Pachter L. 2009. Fast statistical alignment. PLoS Comput Biol 5: e1000392. doi: 10.1371/journal.pcbi.1000392.

CrossRef Medline Google Scholar

[43] Bradley RK,

[44] Roberts A,

[45] Smoot M,

[46] Juvekar S,

[47] Do J,

[48] Dewey C,

[49] Holmes I,

[50] Pachter L

[51] ↵

Brooks TM,

Mittermeier RA,

Mittermeier CG,

da Fonseca GAB,

Rylands AB,

Konstant WR,

Flick P,

Pilgrim J,

Oldfield S,

Magin G

Brooks TM, Mittermeier RA, Mittermeier CG, da Fonseca GAB, Rylands AB, Konstant WR, Flick P, Pilgrim J, Oldfield S, Magin G, et al. 2002. Habitat loss and extinction in the hotspots of biodiversity. Conserv Biol 16: 909–923.

CrossRef Google Scholar

[52] Brooks TM,

[53] Mittermeier RA,

[54] Mittermeier CG,

[55] da Fonseca GAB,

[56] Rylands AB,

[57] Konstant WR,

[58] Flick P,

[59] Pilgrim J,

[60] Oldfield S,

[61] Magin G

[62] ↵

Caceres M,

Lachuer J,

Zapala MA,

Redmond JC,

Kudo L,

Geschwind DH,

Lockhart DJ,

Preuss TM,

Barlow C

Caceres M, Lachuer J, Zapala MA, Redmond JC, Kudo L, Geschwind DH, Lockhart DJ, Preuss TM, Barlow C. 2003. Elevated gene expression levels distinguish human from nonhuman primate brains. Proc Natl Acad Sci 100: 13030–13035.

Abstract/FREE Full Text

[63] Caceres M,

[64] Lachuer J,

[65] Zapala MA,

[66] Redmond JC,

[67] Kudo L,

[68] Geschwind DH,

[69] Lockhart DJ,

[70] Preuss TM,

[71] Barlow C

[72] ↵

Cannarozzi G,

Schneider A,

Gonnet G

Cannarozzi G, Schneider A, Gonnet G. 2007. A phylogenomic study of human, dog, and mouse. PLoS Comput Biol 3: e2. doi: 10.1371/journal.pcbi.0030002.

CrossRef Medline Google Scholar

[73] Cannarozzi G,

[74] Schneider A,

[75] Gonnet G

[76] ↵

The Chimpanzee Sequencing and Analysis Consortium. 2005. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437: 69–87.

CrossRef Medline Google Scholar

[77] ↵

Drosophila 12 Genomes Consortium. 2007. Evolution of genes and genomes on the Drosophila phylogeny. Nature 450: 203–218.

CrossRef Medline Google Scholar

[78] ↵

Fischer A,

Wiebe V,

Paabo S,

Przeworski M

Fischer A, Wiebe V, Paabo S, Przeworski M. 2004. Evidence for a complex demographic history of chimpanzees. Mol Biol Evol 21: 799–808.

Abstract/FREE Full Text

[79] Fischer A,

[80] Wiebe V,

[81] Paabo S,

[82] Przeworski M

[83] ↵

Foxworthy PS,

White SL,

Hoover DM,

Eacho PI

Foxworthy PS, White SL, Hoover DM, Eacho PI. 1990. Effect of ciprofibrate, bezafibrate, and LY171883 on peroxisomal β-oxidation in cultured rat, dog, and rhesus monkey hepatocytes. Toxicol Appl Pharmacol 104: 386–394.

CrossRef Medline Google Scholar

[84] Foxworthy PS,

[85] White SL,

[86] Hoover DM,

[87] Eacho PI

[88] ↵

Frankham R

Frankham R. 2005. Genetics and extinction. Biol Conserv 126: 131–140.

CrossRef Google Scholar

[89] Frankham R

[90] ↵

Frankham R

Frankham R. 2010. Challenges and opportunities of genetic approaches to biological conservation. Biol Conserv 143: 1919–1927.

CrossRef Google Scholar

[91] Frankham R

[92] ↵

Fredsted T,

Pertoldi C,

Schierup MH,

Kappeler PM

Fredsted T, Pertoldi C, Schierup MH, Kappeler PM. 2005. Microsatellite analyses reveal fine-scale genetic structure in grey mouse lemurs (Microcebus murinus). Mol Ecol 14: 2363–2372.

Medline Google Scholar

[93] Fredsted T,

[94] Pertoldi C,

[95] Schierup MH,

[96] Kappeler PM

[97] ↵

Gilad Y,

Oshlack A,

Smyth GK,

Speed TP,

White KP

Gilad Y, Oshlack A, Smyth GK, Speed TP, White KP. 2006. Expression profiling in primates reveals a rapid evolution of human transcription factors. Nature 440: 242–245.

CrossRef Medline Google Scholar

[98] Gilad Y,

[99] Oshlack A,

[100] Smyth GK,

[101] Speed TP,

[102] White KP

[103] ↵

Gnirke A,

Melnikov A,

Maguire J,

Rogov P,

LeProust EM,

Brockman W,

Fennell T,

Giannoukos G,

Fisher S,

Russ C

Gnirke A, Melnikov A, Maguire J, Rogov P, LeProust EM, Brockman W, Fennell T, Giannoukos G, Fisher S, Russ C, et al. 2009. Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing. Nat Biotechnol 27: 182–189.

CrossRef Medline Google Scholar

[104] Gnirke A,

[105] Melnikov A,

[106] Maguire J,

[107] Rogov P,

[108] LeProust EM,

[109] Brockman W,

[110] Fennell T,

[111] Giannoukos G,

[112] Fisher S,

[113] Russ C

[114] ↵

Grabherr MG,

Haas BJ,

Yassour M,

Levin JZ,

Thompson DA,

Amit I,

Adiconis X,

Fan L,

Raychowdhury R,

Zeng Q

Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, et al. 2011. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29: 644–652.

CrossRef Medline Google Scholar

[115] Grabherr MG,

[116] Haas BJ,

[117] Yassour M,

[118] Levin JZ,

[119] Thompson DA,

[120] Amit I,

[121] Adiconis X,

[122] Fan L,

[123] Raychowdhury R,

[124] Zeng Q

[125] ↵

Green GM,

Sussman RW

Green GM, Sussman RW. 1990. Deforestation history of the eastern rain forests of Madagascar from satellite images. Science 248: 212–215.

Abstract/FREE Full Text

[126] Green GM,

[127] Sussman RW

[128] ↵

Harper GJ,

Steininger MK,

Tucker CJ,

Juhn D,

Hawkins F

Harper GJ, Steininger MK, Tucker CJ, Juhn D, Hawkins F. 2007. Fifty years of deforestation and forest fragmentation in Madagascar. Environ Conserv 34: 325–333.

Google Scholar

[129] Harper GJ,

[130] Steininger MK,

[131] Tucker CJ,

[132] Juhn D,

[133] Hawkins F

[134] ↵

Hernandez RD,

Hubisz MJ,

Wheeler DA,

Smith DG,

Ferguson B,

Rogers J,

Nazareth L,

Indap A,

Bourquin T,

McPherson J

Hernandez RD, Hubisz MJ, Wheeler DA, Smith DG, Ferguson B, Rogers J, Nazareth L, Indap A, Bourquin T, McPherson J, et al. 2007. Demographic histories and patterns of linkage disequilibrium in Chinese and Indian rhesus macaques. Science 316: 240–243.

Abstract/FREE Full Text

[135] Hernandez RD,

[136] Hubisz MJ,

[137] Wheeler DA,

[138] Smith DG,

[139] Ferguson B,

[140] Rogers J,

[141] Nazareth L,

[142] Indap A,

[143] Bourquin T,

[144] McPherson J

[145] ↵

Hoivik DJ,

Qualls CW Jr.,

Mirabile RC,

Cariello NF,

Kimbrough CL,

Colton HM,

Anderson SP,

Santostefano MJ,

Morgan RJ,

Dahl RR

Hoivik DJ, Qualls CW Jr., Mirabile RC, Cariello NF, Kimbrough CL, Colton HM, Anderson SP, Santostefano MJ, Morgan RJ, Dahl RR, et al. 2004. Fibrates induce hepatic peroxisome and mitochondrial proliferation without overt evidence of cellular proliferation and oxidative stress in cynomolgus monkeys. Carcinogenesis 25: 1757–1769.

Abstract/FREE Full Text

[146] Hoivik DJ,

[147] Qualls CW Jr.,

[148] Mirabile RC,

[149] Cariello NF,

[150] Kimbrough CL,

[151] Colton HM,

[152] Anderson SP,

[153] Santostefano MJ,

[154] Morgan RJ,

[155] Dahl RR

[156] ↵

Horvath JE,

Willard HF

Horvath JE, Willard HF. 2007. Primate comparative genomics: Lemur biology and evolution. Trends Genet 23: 173–182.

CrossRef Medline Google Scholar

[157] Horvath JE,

[158] Willard HF

[159] ↵

International Union for Conservation of Nature. 2010. Red List of Threatened Species Version 2010.4. http://www.iucnredlist.org.

[160] Islinger M,

Cardoso MJ,

Schrader M

Islinger M, Cardoso MJ, Schrader M. 2010. Be different–the diversity of peroxisomes in the animal kingdom. Biochim Biophys Acta 1803: 881–897.

CrossRef Medline Google Scholar

[161] Islinger M,

[162] Cardoso MJ,

[163] Schrader M

[164] ↵

Jiang Z,

Tang H,

Ventura M,

Cardone MF,

Marques-Bonet T,

She X,

Pevzner PA,

Eichler EE

Jiang Z, Tang H, Ventura M, Cardone MF, Marques-Bonet T, She X, Pevzner PA, Eichler EE. 2007. Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolution. Nat Genet 39: 1361–1368.

CrossRef Medline Google Scholar

[165] Jiang Z,

[166] Tang H,

[167] Ventura M,

[168] Cardone MF,

[169] Marques-Bonet T,

[170] She X,

[171] Pevzner PA,

[172] Eichler EE

[173] ↵

Keebaugh AC,

Thomas JW

Keebaugh AC, Thomas JW. 2010. The evolutionary fate of the genes encoding the purine catabolic enzymes in hominoids, birds, and reptiles. Mol Biol Evol 27: 1359–1369.

Abstract/FREE Full Text

[174] Keebaugh AC,

[175] Thomas JW

[176] ↵

Khaitovich P,

Hellmann I,

Enard W,

Nowick K,

Leinweber M,

Franz H,

Weiss G,

Lachmann M,

Paabo S

Khaitovich P, Hellmann I, Enard W, Nowick K, Leinweber M, Franz H, Weiss G, Lachmann M, Paabo S. 2005. Parallel patterns of evolution in the genomes and transcriptomes of humans and chimpanzees. Science 309: 1850–1854.

Abstract/FREE Full Text

[177] Khaitovich P,

[178] Hellmann I,

[179] Enard W,

[180] Nowick K,

[181] Leinweber M,

[182] Franz H,

[183] Weiss G,

[184] Lachmann M,

[185] Paabo S

[186] ↵

Kimura M,

Maruyama T,

Crow JF

Kimura M, Maruyama T, Crow JF. 1963. The mutation load in small populations. Genetics 48: 1303–1312.

Medline Google Scholar

[187] Kimura M,

[188] Maruyama T,

[189] Crow JF

[190] ↵

Lawler RR

Lawler RR. 2008. Testing for a historical population bottleneck in wild Verreaux's sifaka (Propithecus verreauxi verreauxi) using microsatellite data. Am J Primatol 70: 990–994.

CrossRef Medline Google Scholar

[191] Lawler RR

[192] ↵

Li H,

Durbin R

Li H, Durbin R. 2009. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25: 1754–1760.

Abstract/FREE Full Text

[193] Li H,

[194] Durbin R

[195] ↵

Locke DP,

Hillier LW,

Warren WC,

Worley KC,

Nazareth LV,

Muzny DM,

Yang SP,

Wang Z,

Chinwalla AT,

Minx P

Locke DP, Hillier LW, Warren WC, Worley KC, Nazareth LV, Muzny DM, Yang SP, Wang Z, Chinwalla AT, Minx P, et al. 2011. Comparative and demographic analysis of orang-utan genomes. Nature 469: 529–533.

CrossRef Medline Google Scholar

[196] Locke DP,

[197] Hillier LW,

[198] Warren WC,

[199] Worley KC,

[200] Nazareth LV,

[201] Muzny DM,

[202] Yang SP,

[203] Wang Z,

[204] Chinwalla AT,

[205] Minx P

[206] ↵

Louis EE,

Ratsimbazafy JH,

Razakamaharauo VR,

Pierson DJ,

Barber RC,

Brenneman RA

Louis EE, Ratsimbazafy JH, Razakamaharauo VR, Pierson DJ, Barber RC, Brenneman RA. 2005. Conservation genetics of black and white ruffed lemurs, Varecia variegata, from Southeastern Madagascar. Anim Conserv 8: 105–111.

CrossRef Google Scholar

[207] Louis EE,

[208] Ratsimbazafy JH,

[209] Razakamaharauo VR,

[210] Pierson DJ,

[211] Barber RC,

[212] Brenneman RA

[213] ↵

Mittermeier RA,

Ganzhorn JU,

Konstant WR,

Glander K,

Tattersall I,

Groves CP,

Rylands AB,

Hapke A,

Ratsimbazafy J,

Mayor MI

Mittermeier RA, Ganzhorn JU, Konstant WR, Glander K, Tattersall I, Groves CP, Rylands AB, Hapke A, Ratsimbazafy J, Mayor MI, et al. 2008. Lemur diversity in Madagascar. Int J Primatol 29: 1607–1656.

CrossRef Google Scholar

[214] Mittermeier RA,

[215] Ganzhorn JU,

[216] Konstant WR,

[217] Glander K,

[218] Tattersall I,

[219] Groves CP,

[220] Rylands AB,

[221] Hapke A,

[222] Ratsimbazafy J,

[223] Mayor MI

[224] ↵

Mittermeier RA,

Louis EE,

Richardson M,

Schwitzer C,

Langrand O,

Rylands AB,

Hawkins F,

Rajaobelina S,

Ratsimbazafy J,

Rasoloarison R

Mittermeier RA, Louis EE, Richardson M, Schwitzer C, Langrand O, Rylands AB, Hawkins F, Rajaobelina S, Ratsimbazafy J, Rasoloarison R, et al. 2010. Lemurs of Madagascar. Conservation International, Arlington, VA.

Google Scholar

[225] Mittermeier RA,

[226] Louis EE,

[227] Richardson M,

[228] Schwitzer C,

[229] Langrand O,

[230] Rylands AB,

[231] Hawkins F,

[232] Rajaobelina S,

[233] Ratsimbazafy J,

[234] Rasoloarison R

[235] ↵

Nakamura K,

Oshima T,

Morimoto T,

Ikeda S,

Yoshikawa H,

Shiwa Y,

Ishikawa S,

Linak MC,

Hirai A,

Takahashi H

Nakamura K, Oshima T, Morimoto T, Ikeda S, Yoshikawa H, Shiwa Y, Ishikawa S, Linak MC, Hirai A, Takahashi H, et al. 2011. Sequence-specific error profile of Illumina sequencers. Nucleic Acids Res 39: e90. doi: 10.1093/nar/gkr344.

Abstract/FREE Full Text

[236] Nakamura K,

[237] Oshima T,

[238] Morimoto T,

[239] Ikeda S,

[240] Yoshikawa H,

[241] Shiwa Y,

[242] Ishikawa S,

[243] Linak MC,

[244] Hirai A,

[245] Takahashi H

[246] ↵

O'Brien SJ,

Johnson WE

O'Brien SJ, Johnson WE. 2005. Big cat genomics. Annu Rev Genomics Hum Genet 6: 407–429.

CrossRef Medline Google Scholar

[247] O'Brien SJ,

[248] Johnson WE

[249] ↵

O'Brien SJ,

Wildt DE,

Goldman D,

Merril CR,

Bush M

O'Brien SJ, Wildt DE, Goldman D, Merril CR, Bush M. 1983. The cheetah is depauperate in genetic variation. Science 221: 459–462.

Abstract/FREE Full Text

[250] O'Brien SJ,

[251] Wildt DE,

[252] Goldman D,

[253] Merril CR,

[254] Bush M

[255] ↵

O'Brien SJ,

Roelke ME,

Marker L,

Newman A,

Winkler CA,

Meltzer D,

Colly L,

Evermann JF,

Bush M,

Wildt DE

O'Brien SJ, Roelke ME, Marker L, Newman A, Winkler CA, Meltzer D, Colly L, Evermann JF, Bush M, Wildt DE. 1985. Genetic basis for species vulnerability in the cheetah. Science 227: 1428–1434.

Abstract/FREE Full Text

[256] O'Brien SJ,

[257] Roelke ME,

[258] Marker L,

[259] Newman A,

[260] Winkler CA,

[261] Meltzer D,

[262] Colly L,

[263] Evermann JF,

[264] Bush M,

[265] Wildt DE

[266] ↵

Oleksiak MF,

Churchill GA,

Crawford DL

Oleksiak MF, Churchill GA, Crawford DL. 2002. Variation in gene expression within and among natural populations. Nat Genet 32: 261–266.

CrossRef Medline Google Scholar

[267] Oleksiak MF,

[268] Churchill GA,

[269] Crawford DL

[270] ↵

Palstra FP,

Ruzzante DE

Palstra FP, Ruzzante DE. 2008. Genetic estimates of contemporary effective population size: What can they tell us about the importance of genetic stochasticity for wild population persistence? Mol Ecol 17: 3428–3447.

CrossRef Medline Google Scholar

[271] Palstra FP,

[272] Ruzzante DE

[273] ↵

Pastorini J,

Zaramody A,

Curtis DJ,

Nievergelt CM,

Mundy NI

Pastorini J, Zaramody A, Curtis DJ, Nievergelt CM, Mundy NI. 2009. Genetic analysis of hybridization and introgression between wild mongoose and brown lemurs. BMC Evol Biol 9: 32. doi: 10.1186/1471-2148-9-32.

CrossRef Medline Google Scholar

[274] Pastorini J,

[275] Zaramody A,

[276] Curtis DJ,

[277] Nievergelt CM,

[278] Mundy NI

[279] ↵

Perelman P,

Johnson WE,

Roos C,

Seuanez HN,

Horvath JE,

Moreira MA,

Kessing B,

Pontius J,

Roelke M,

Rumpler Y

Perelman P, Johnson WE, Roos C, Seuanez HN, Horvath JE, Moreira MA, Kessing B, Pontius J, Roelke M, Rumpler Y, et al. 2011. A molecular phylogeny of living primates. PLoS Genet 7: e1001342. doi: 10.1371/journal.pgen.1001342.

CrossRef Medline Google Scholar

[280] Perelman P,

[281] Johnson WE,

[282] Roos C,

[283] Seuanez HN,

[284] Horvath JE,

[285] Moreira MA,

[286] Kessing B,

[287] Pontius J,

[288] Roelke M,

[289] Rumpler Y

[290] ↵

Perry GH,

Martin RD,

Verrelli BC

Perry GH, Martin RD, Verrelli BC. 2007. Signatures of functional constraint at aye-aye opsin genes: The potential of adaptive color vision in a nocturnal primate. Mol Biol Evol 24: 1963–1970.

Abstract/FREE Full Text

[291] Perry GH,

[292] Martin RD,

[293] Verrelli BC

[294] ↵

Perry GH,

Marioni JC,

Melsted P,

Gilad Y

Perry GH, Marioni JC, Melsted P, Gilad Y. 2010. Genomic-scale capture and sequencing of endogenous DNA from feces. Mol Ecol 19: 5332–5344.

CrossRef Medline Google Scholar

[295] Perry GH,

[296] Marioni JC,

[297] Melsted P,

[298] Gilad Y

[299] ↵

Pevzner PA,

Tang H,

Waterman MS

Pevzner PA, Tang H, Waterman MS. 2001. An Eulerian path approach to DNA fragment assembly. Proc Natl Acad Sci 98: 9748–9753.

Abstract/FREE Full Text

[300] Pevzner PA,

[301] Tang H,

[302] Waterman MS

[303] ↵

Pickrell JK,

Pai AA,

Gilad Y,

Pritchard JK

Pickrell JK, Pai AA, Gilad Y, Pritchard JK. 2010. Noisy splicing drives mRNA isoform diversity in human cells. PLoS Genet 6: e1001236. doi: 10.1371/journal.pgen.1001236.

CrossRef Medline Google Scholar

[304] Pickrell JK,

[305] Pai AA,

[306] Gilad Y,

[307] Pritchard JK

[308] ↵

Quemere E,

Crouau-Roy B,

Rabarivola C,

Louis EE Jr.,

Chikhi L

Quemere E, Crouau-Roy B, Rabarivola C, Louis EE Jr., Chikhi L. 2010. Landscape genetics of an endangered lemur (Propithecus tattersalli) within its entire fragmented range. Mol Ecol 19: 1606–1621.

Medline Google Scholar

[309] Quemere E,

[310] Crouau-Roy B,

[311] Rabarivola C,

[312] Louis EE Jr.,

[313] Chikhi L

[314] ↵

Razakamaharavo VR,

McGuire SM,

Vasey N,

Louis EE Jr.,

Brenneman RA

Razakamaharavo VR, McGuire SM, Vasey N, Louis EE Jr., Brenneman RA. 2010. Genetic architecture of two red ruffed lemur (Varecia rubra) populations of Masoala national park. Primates 51: 53–61.

CrossRef Medline Google Scholar

[315] Razakamaharavo VR,

[316] McGuire SM,

[317] Vasey N,

[318] Louis EE Jr.,

[319] Brenneman RA

[320] ↵

Rhesus Macaque Genome Sequencing and Analysis Consortium. 2007. Evolutionary and biomedical insights from the rhesus macaque genome. Science 316: 222–234.

Abstract/FREE Full Text

[321] ↵

Robertson G,

Schein J,

Chiu R,

Corbett R,

Field M,

Jackman SD,

Mungall K,

Lee S,

Okada HM,

Qian JQ

Robertson G, Schein J, Chiu R, Corbett R, Field M, Jackman SD, Mungall K, Lee S, Okada HM, Qian JQ, et al. 2010. De novo assembly and analysis of RNA-seq data. Nat Methods 7: 909–912.

CrossRef Medline Google Scholar

[322] Robertson G,

[323] Schein J,

[324] Chiu R,

[325] Corbett R,

[326] Field M,

[327] Jackman SD,

[328] Mungall K,

[329] Lee S,

[330] Okada HM,

[331] Qian JQ

[332] ↵

Simpson JT,

Wong K,

Jackman SD,

Schein JE,

Jones SJ,

Birol I

Simpson JT, Wong K, Jackman SD, Schein JE, Jones SJ, Birol I. 2009. ABySS: A parallel assembler for short read sequence data. Genome Res 19: 1117–1123.

Abstract/FREE Full Text

[333] Simpson JT,

[334] Wong K,

[335] Jackman SD,

[336] Schein JE,

[337] Jones SJ,

[338] Birol I

[339] ↵

Somel M,

Creely H,

Franz H,

Mueller U,

Lachmann M,

Khaitovich P,

Paabo S

Somel M, Creely H, Franz H, Mueller U, Lachmann M, Khaitovich P, Paabo S. 2008. Human and chimpanzee gene expression differences replicated in mice fed different diets. PLoS One 3: e1504. doi: 10.1371/journal.pone.0001504.

CrossRef Medline Google Scholar

[340] Somel M,

[341] Creely H,

[342] Franz H,

[343] Mueller U,

[344] Lachmann M,

[345] Khaitovich P,

[346] Paabo S

[347] ↵

Tavare S,

Marshall CR,

Will O,

Soligo C,

Martin RD

Tavare S, Marshall CR, Will O, Soligo C, Martin RD. 2002. Using the fossil record to estimate the age of the last common ancestor of extant primates. Nature 416: 726–729.

CrossRef Medline Google Scholar

[348] Tavare S,

[349] Marshall CR,

[350] Will O,

[351] Soligo C,

[352] Martin RD

[353] ↵

Voight BF,

Adams AM,

Frisse LA,

Qian Y,

Hudson RR,

Di Rienzo A

Voight BF, Adams AM, Frisse LA, Qian Y, Hudson RR, Di Rienzo A. 2005. Interrogating multiple aspects of variation in a full resequencing data set to infer human population size changes. Proc Natl Acad Sci 102: 18508–18513.

Abstract/FREE Full Text

[354] Voight BF,

[355] Adams AM,

[356] Frisse LA,

[357] Qian Y,

[358] Hudson RR,

[359] Di Rienzo A

[360] ↵

Wall JD,

Cox MP,

Mendez FL,

Woerner A,

Severson T,

Hammer MF

Wall JD, Cox MP, Mendez FL, Woerner A, Severson T, Hammer MF. 2008. A novel DNA sequence database for analyzing human demographic history. Genome Res 18: 1354–1361.

Abstract/FREE Full Text

[361] Wall JD,

[362] Cox MP,

[363] Mendez FL,

[364] Woerner A,

[365] Severson T,

[366] Hammer MF

[367] ↵

Watkins PA,

Moser AB,

Toomer CB,

Steinberg SJ,

Moser HW,

Karaman MW,

Ramaswamy K,

Siegmund KD,

Lee DR,

Ely JJ

Watkins PA, Moser AB, Toomer CB, Steinberg SJ, Moser HW, Karaman MW, Ramaswamy K, Siegmund KD, Lee DR, Ely JJ, et al. 2010. Identification of differences in human and great ape phytanic acid metabolism that could influence gene expression profiles and physiological functions. BMC Physiol 10: 19. doi: 10.1186/1472-6793-10-19.

CrossRef Medline Google Scholar

[368] Watkins PA,

[369] Moser AB,

[370] Toomer CB,

[371] Steinberg SJ,

[372] Moser HW,

[373] Karaman MW,

[374] Ramaswamy K,

[375] Siegmund KD,

[376] Lee DR,

[377] Ely JJ

[378] ↵

Yang Z

Yang Z. 2007. PAML 4: Phylogenetic analysis by maximum likelihood. Mol Biol Evol 24: 1586–1591.

Abstract/FREE Full Text

[379] Yang Z

[380] ↵

Yao R,

Schneider E,

Ryan TJ,

Galivan J

Yao R, Schneider E, Ryan TJ, Galivan J. 1996. Human gamma-glutamyl hydrolase: Cloning and characterization of the enzyme expressed in vitro. Proc Natl Acad Sci 93: 10134–10138.

Abstract/FREE Full Text

[381] Yao R,

[382] Schneider E,

[383] Ryan TJ,

[384] Galivan J

[385] ↵

Yu N,

Jensen-Seaman MI,

Chemnick L,

Kidd JR,

Deinard AS,

Ryder O,

Kidd KK,

Li WH

Yu N, Jensen-Seaman MI, Chemnick L, Kidd JR, Deinard AS, Ryder O, Kidd KK, Li WH. 2003. Low nucleotide diversity in chimpanzees and bonobos. Genetics 164: 1511–1518.

Medline Google Scholar

[386] Yu N,

[387] Jensen-Seaman MI,

[388] Chemnick L,

[389] Kidd JR,

[390] Deinard AS,

[391] Ryder O,

[392] Kidd KK,

[393] Li WH

[394] ↵

Zerbino DR,

Birney E

Zerbino DR, Birney E. 2008. Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18: 821–829.

Abstract/FREE Full Text

[395] Zerbino DR,

[396] Birney E

Comparative RNA sequencing reveals substantial genetic variation in endangered primates

Abstract

Results

De novo transcript assembly

Genetic diversity and endangered status

Gene structure evolution

Natural selection at the gene regulatory and sequence levels

Discussion

Comparative primate genomics

Lemur genetic diversity

Conclusion

Methods

Overview

Estimating gene expression levels

SNP identification

Data access

Acknowledgments

Footnotes

References

Articles citing this article

This Article

Article Category

Services

Citing Articles

Google Scholar

PubMed/NCBI

Share

Preprint Server

Navigate This Article

Current Issue

In This Issue