Table 2.

Distribution of Hammerhead and Hammerhead-like Motifs in the Different Sections of the GenBank: Mutants of the Single Stranded Regions

prirodmamvrtinvplnbctrnavrlphgsynunaestpatstsgsshtgTotalTotal over expectedExpected (A = C = G = T)
HH-I5121014176119130037350186454171089160.561635
HH-I-3182721245997560553215916607021251011948640460.577081
HH-I-42702903236522822610693813585543327219630.553576
HH-I-5310701497506400309277107164160139238229910.466488
HH-I-6197601737516312399310758170093116728828210.496973
HH-I-817144112279040745510138115052186106142031620.466883
HH-I-924756122950826045401079735369996534427450.387314
HH-I-12946224411037546134540180138218561374826379464360.946865
HH-I-1320976164142830846259310110739143126622628450.426775
HH-I-143179430364043774877104530573710287732731470.506362
HH-I-17652501820114416205041102255341912211030.l542031
HH-II83331517512614118011343530489112180.542246
HH-II-31807882999464571532041425231022102118250246370.667063
HH-II-41335671138726127811001256584373918821770.514277
HH-II-52448216375734773727141104491070127136233920.398788
HH-II-6234812317522348977381537798567485931137240.497674
HH-II-820964112292138551916130310412655376545241360.547710
HH-II-9255588244573195406140911588482105128131400.397979
HH-II-1212902536160879716534120914183249112556273102780100.978285
HH-II-132149145264213265347162823310317175322832500.447404
HH-II-14281811537414524493614272410388796725634630.438069
HH-II-1710035172201521731971145043671610814630.532786
HH-III42602936765021120965738645270.55952
HH-III-396328186253052450746303107257429121640.643397
HH-III-429512121192128980363502046332322913130.761725
HH-III-514438163530120523103422038876113221917340.483612
HH-III-61342951829018319795823042666103318016430.463540
HH-III-8922646528203260072212029415442925525800.683774
HH-III-9109220122391431960873404626373816415490.493918
HH-III-1265897403236931821709836010711352012147736621.033540
HH-III-13119298142261762463536203457062814214730.413576
HH-III-1413333101124218419516522104088653914615810.453504
HH-III-1749603123818602322015467411746850.491402
GenBank  relative  size0.140.030.010.010.070.070.07<0.010.03<0.01<0.01<0.010.370.020.010.060.101.00

[i] The motifs are named as explained in Methods and in Figure 1. The “expected” number of occurrences was obtained by searching the different motifs in a database of 1000 random sequences of 100,000 nucleotides (equal representations of A, C, G, and T) and correcting the frequency to the relative size of the GenBank. The “total over expected” shows the ratio of occurrences obtained in the GenBank versus the expected ones according to the search performed in a random database with the size of the GenBank.

[ii] (pri) Primate sequence entries (from the two GenBank files); (rod) rodent sequence entries; (mam) other mammalian sequence entries; (vrt) other vertebrate sequence entries; (inv) invertebrate sequence entries; (pln) plant sequence entries (including fungi and algae); (bct) bacterial sequence entries; (rna) structural RNA sequence entries; (vrl) viral sequence entries; (phg) phage sequence entries; (syn) synthetic and chimeric sequence entries; (una) unannotated sequence entries; (est) EST (expressed sequence tag) (from 23 GenBank files); (pat) patent sequence entries; (sts) STS (sequence tagged site) sequence entries; (gss) GSS (genome survery sequence) sequence entries; (htg) HTGS (high throughput genomic sequencing) sequence entries.