Methods

Estimating the Repeat Structure and Length of DNA Sequences Using ℓ-Tuples

    • 1 Department of Mathematics, University of Southern California, Los Angeles, California 90089, USA
    • 2 Celera Genomics, Rockville, Maryland 20850, USA
Published August 5, 2003. Vol 13 Issue 8, pp. 1916-1922. https://doi.org/10.1101/gr.1251803
Download PDF Please log-in to or register for your personal account in order to access PDF Cite Article Permissions Share
cover of Genome Research Vol 36 Issue 4
Current Issue:

Abstract

In shotgun sequencing projects, the genome or BAC length is not always known. We approach estimating genome length by first estimating the repeat structure of the genome or BAC, sometimes of interest in its own right, on the basis of a set of random reads from a genome project. Moreover, we can find the consensus for repeat families before assembly. Our methods are based on the ℓ-tuple content of the reads.

Loading
Loading
Back to top