TY - JOUR A1 - Li, Xiaoman A1 - Waterman, Michael S. T1 - Estimating the Repeat Structure and Length of DNA Sequences Using ℓ-Tuples Y1 - 2003/08/01 JF - Genome Research JO - Genome Research SP - 1916 EP - 1922 DO - 10.1101/gr.1251803 VL - 13 IS - 8 UR - http://genome.cshlp.org/content/13/8/1916.abstract N2 - In shotgun sequencing projects, the genome or BAC length is not always known. We approach estimating genome length by first estimating the repeat structure of the genome or BAC, sometimes of interest in its own right, on the basis of a set of random reads from a genome project. Moreover, we can find the consensus for repeat families before assembly. Our methods are based on the ℓ-tuple content of the reads. ER -