RazerS—fast read mapping with sensitivity control

  1. David Weese1,3,
  2. Anne-Katrin Emde1,
  3. Tobias Rausch2,
  4. Andreas Döring1 and
  5. Knut Reinert1
  1. 1Department of Computer Science, Free University of Berlin, 14195 Berlin, Germany;
  2. 2International Max Planck Research School for Computational Biology and Scientific Computing, 14195 Berlin, Germany

    Abstract

    Second-generation sequencing technologies deliver DNA sequence data at unprecedented high throughput. Common to most biological applications is a mapping of the reads to an almost identical or highly similar reference genome. Due to the large amounts of data, efficient algorithms and implementations are crucial for this task. We present an efficient read mapping tool called RazerS. It allows the user to align sequencing reads of arbitrary length using either the Hamming distance or the edit distance. Our tool can work either lossless or with a user-defined loss rate at higher speeds. Given the loss rate, we present an approach that guarantees not to lose more reads than specified. This enables the user to adapt to the problem at hand and provides a seamless tradeoff between sensitivity and running time.

    Footnotes

    This Article

    1. Genome Res. Copyright © 2009 by Cold Spring Harbor Laboratory Press

    Article Category

    Share

    Preprint Server