Somatic retrotransposition in human cancer revealed by whole-genome and exome sequencing
- Elena Helman1,
- Michael L. Lawrence2,
- Chip Stewart2,
- Carrie Sougnez2,
- Gad Getz2 and
- Matthew Meyerson3,4
- 1 Harvard-MIT Division of Health Sciences & Technology;
- 2 Broad Institute;
- 3 Dana-Farber Cancer Institute
- ↵* Corresponding author; email: matthew_meyerson{at}dfci.harvard.edu
Abstract
Retrotransposons comprise over 40% of the human genome and constitute a major source of genetic variation. Somatic retrotransposon insertions have been reported in cancer. Here we present a comprehensive analysis of somatic retrotransposon insertions across multiple cancers. We applied TranspoSeq, a computational framework that identifies somatic retrotransposon insertions from massively parallel sequencing data, to whole-genome sequences from 200 tumor/normal pairs across 11 cancer types as part of the Pan-Cancer Project in The Cancer Genome Atlas (TCGA). In addition to novel germline polymorphisms, we find 810 somatic retrotransposon insertions primarily in lung squamous cell, head and neck, colorectal and endometrial carcinomas. Many somatic retrotransposon insertions fall into genic regions of known and putative cancer genes, as well as into exons. High somatic retrotransposition rates in tumors are associated with high rates of genomic rearrangement and somatic mutation. In addition, we developed TranspoSeq-Exome and interrogated TCGA hybrid-capture exome data in 767 tumor/normal pairs for retrotransposon movement, thereby discovering 35 additional somatic retrotransposon insertions into exonic regions, including a novel insertion into an exon of the PTEN tumor suppressor gene. These results suggest that somatic retrotransposon insertions may represent an important class of structural variation in cancer.
- Received July 17, 2013.
- Accepted April 3, 2014.
- Published by Cold Spring Harbor Laboratory Press
This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported), as described at http://creativecommons.org/licenses/by-nc/3.0/.











