RT Journal A1 Pevzner, Paul A. A1 Tang, Haixu A1 Tesler, Glenn T1 De Novo Repeat Classification and Fragment Assembly JF Genome Research JO Genome Research YR 2004 FD September 01 VO 14 IS 9 SP 1786 OP 1796 DO 10.1101/gr.2395204 UL http://genome.cshlp.org/content/14/9/1786.abstract AB Repetitive sequences make up a significant fraction of almost any genome, and an important and still open question in bioinformatics is how to represent all repeats in DNA sequences. We propose a new approach to repeat classification that represents all repeats in a genome as a mosaic of sub-repeats. Our key algorithmic idea also leads to new approaches to multiple alignment and fragment assembly. In particular, we show that our FragmentGluer assembler improves on Phrap and ARACHNE in assembly of BACs and bacterial genomes.