Table 2.

Features of Repetitive DNA Families

Repetitive family Initial copy Accession no. for consensus Element size (bp) Minimum no. of copies in genome[i] Potential TSD (bp) Terminal 10-bp sequences(s)
IR-1F18E3 U86946 3791625‘-CTCGGCATTC-3‘
IR-2C35B1 U86947 781474,5,6,95‘-CACTGCAACT-3‘
IR-3F32D1 U86948 57814025‘-AAGGTGGTGT-3‘
IR-4ZK6 U86949 22712None5‘-TATTACCGGT-3‘
IR-5F23F1 U86950 19814None5‘-TATTACGGGA-3‘
TR-1F49F1 U86951 16723None5‘-ATTTACTTCT-3‘
5‘-AATACTACAC-3‘

[i] Copies found as of November 27, 1996 with BLASTP <10−25 using the consensus element to search GenBank and the C. elegans Consortium database, at which time ∼80% of the genome sequence was available. The following features are indicated: name of the family, the cosmid in which the initial element was identified, the GenBank accession number for the consensus element sequence derived from several element copies, the size of the full-length element, the minimal number of copies per genome, target site duplication (if any), and the sequence at the termini. In each case, the 5‘ end of the sequence represents the terminus of the element.