TIGRA: A targeted iterative graph routing assembler for breakpoint assembly

  1. George Weinstock3,5
  1. 1Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA;
  2. 2Department of Computer Science, Rice University, Houston, Texas 77005, USA;
  3. 3The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63110, USA;
  4. 4Department of Medicine, Washington University School of Medicine, St. Louis, Missouri 63110, USA;
  5. 5Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA

    Abstract

    Recent progress in next-generation sequencing has greatly facilitated our study of genomic structural variation. Unlike single nucleotide variants and small indels, many structural variants have not been completely characterized at nucleotide resolution. Deriving the complete sequences underlying such breakpoints is crucial for not only accurate discovery, but also for the functional characterization of altered alleles. However, our current ability to determine such breakpoint sequences is limited because of challenges in aligning and assembling short reads. To address this issue, we developed a targeted iterative graph routing assembler, TIGRA, which implements a set of novel data analysis routines to achieve effective breakpoint assembly from next-generation sequencing data. In our assessment using data from the 1000 Genomes Project, TIGRA was able to accurately assemble the majority of deletion and mobile element insertion breakpoints, with a substantively better success rate and accuracy than other algorithms. TIGRA has been applied in the 1000 Genomes Project and other projects and is freely available for academic use.

    Footnotes

    • 6 These authors contributed equally to this work.

    • 7 Corresponding author

      E-mail kchen3{at}mdanderson.org

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.162883.113.

    • Received July 1, 2013.
    • Accepted December 3, 2013.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported), as described at http://creativecommons.org/licenses/by-nc/3.0/.

    | Table of Contents

    Preprint Server