
Flow diagram of the Drosophila repeat discovery protocol. Sixteen hundred and fifty-six of 14,226 predicted Drosophilaproteins were found to contain repeats (see Methods), of which 523 represented previously unknown repeats. Following a clustering step, these were partitioned into 224 groups and 455 orphans that were subjected individually to manual analyses. This resulted in the identification of 41 families of repeats and domains whose multiple alignments have been deposited in the SMART database (http://www.smart.embl-heidelberg.de).











