RT Journal A1 Li, Xiaoman A1 Waterman, Michael S. T1 Estimating the Repeat Structure and Length of DNA Sequences Using ℓ-Tuples JF Genome Research JO Genome Research YR 2003 FD August 01 VO 13 IS 8 SP 1916 OP 1922 DO 10.1101/gr.1251803 UL http://genome.cshlp.org/content/13/8/1916.abstract AB In shotgun sequencing projects, the genome or BAC length is not always known. We approach estimating genome length by first estimating the repeat structure of the genome or BAC, sometimes of interest in its own right, on the basis of a set of random reads from a genome project. Moreover, we can find the consensus for repeat families before assembly. Our methods are based on the ℓ-tuple content of the reads.