Reconstructing double-stranded DNA fragments on a single-molecule level reveals patterns of degradation in ancient samples

(Downloading may take up to 30 seconds. If the slide opens in your browser, select File -> Save As to save it.)

Click on image to view larger version.

Figure 1.
Figure 1.

Identification of sequences from DNA strands originating from the same double-stranded DNA molecules. (A) Schematic overview of MatchSeq: DNA fragments isolated from ancient or recent biological material are diluted to a relatively small number, denatured by heat, converted into a single-stranded library, amplified, and deeply sequenced. Double-stranded fragments are then reconstructed in silico by identifying overlapping sequences mapping to the reference genome in reverse complementary orientation or in close proximity on the same strand. Sequences <35 bp cannot be confidently mapped to the reference and are discarded. The same analysis is performed for an equal number of control sequences obtained from shallow sequencing of a vast amount of artificially fragmented modern human DNA, in which sequences are expected to be distributed randomly across the reference genome. (B) Distribution of distances between the 5′ end of each sequence to the nearest 3′ end on the opposing strand obtained from Neanderthal DNA (Vindija 33.19; gray bars) and the control (red line). (C) Distribution of distances between sequence alignments on the same strand. Molecular structures putatively underlying the observed signals are indicated by schematic drawings. Filled DNA strands indicate sequences that were recovered; unfilled strands are hypothesized to be present but were not sequenced.

This Article

  1. Genome Res. 30: 1449-1457

Preprint Server