Table 4.

Known and Potential Restriction Sites (RS) of Size 4 and 6 in Bacterial Genomes

Species %G + C Length 4 Length 6
RS Potential RS RS Potential RS
aepe56.5?NA?NA
aqae43.5? TCGA, GGCC, CTAG, CCGG, GCGC ? CGTACG, AGTACT, CTCGAG, GATATC, GAGCTC
arfu48.6? GATC, CTAG, CGCG, GCGC, GGCC ? CTCGAG, GAGCTC, GTCGAC, AAGCTT, CAGCTG
basu43.5 GGCC, CCGG, CGCG, TCGA AATT, CATG, GATC, TGCA, ACGT ATCGAT, CTGCAG, GGATCC, TCCGGA AAATTT, ATATAT, GAGCTC, CAGCTG, AATATT
bobu28.6? TATA, AATT, GTAC, CATG, ACGT ?NA
caje30.5? GCGC, ACGT, GATC, CATG, GGCC ? GAATTC, GATATC, GAGCTC, ATGCAT, GTATAC
chpn40.5? CTAG, ATAT, CCGG, GTAC, TGCA ?NA
chtr41.3? TATA, CTAG, AATT, ATAT, GTAC ?NA
esco50.8 GATC GGCC, CTAG, CGCG, TATA, CATG AAGCTT, AGCGCT, AGGCCT, AGTACT, ATGCAT, CACGTG, CCGCGG, CGGCCG, GAATTC, GAGCTC, GATATC, GCCGGC, GCGCGC, GGCGCC, GGGCCC, GGTACC, TACGTA CTGCAG, CTGCAG, TCCGGA, GCATGC, GTCGAC
hain38.2 CGCG, GCGC, CATG, CCGG GGCC, TATA, TTAA, AATT, TCGA AAGCTT GTTAAC, TAATTA, GAATTC, GATATC, GTGCAC
hepy38.9 GATC, TCGA, GGCC GCGC, ACGT, CGCG, AGCT, GTAC ? GCGCGC, GATATC, GAATTC, GCTAGC, TCTAGA
meja31.4 CTAG, GATC, GTAC GGCC, CATG, GCGC, ATAT, TATA ? GTTAAC, CATATG, GTATAC, GAGCTC, GATATC
meth49.5 GATC NA?NA
myge31.7? TATA, AGCT, CATG, TGCA, CTAG ? GAATTC, TAATTA, AAGCTT, GGATCC, GGTACC
myle57.8?NA?NA
mypn40.0?NA? CTTAAG, ATTAAT, AAGCTT, AAATTT, CTCGAG
mytu65.6?NA?NA
neme51.8 GATC GGCC, CCGG, CATG, AATT, TGCA CAGCTG GGCGCC, CCGCGG, CGCGCG, GCGCGC, CTGCAG
psae66.6? CTAG, CGCG, GTAC, GGCC, AGCT AGATCT, CAGCTG, CCCGGG, CCGCGG, CTCGAG, CTGCAG, GCATGC, GGATCC GAGCTC, CGATCG, CGGCCG, GTCGAC, AAGCTT
pyab44.7?NA? GTTAAC, GCTAGC, GTCGAC, CCTAGG, AGGCCT
pyho42.0?NA?NA
ripr29.0? TATA, AATT, CTAG, TTAA, TGCA ? TAATTA, GGTACC, GATATC, GGATCC, GAGCTC
sysp47.7 CCGG CGCG, GGCC, GCGC, TGCA, TATA ATCGAT, CCGCGG, TGATCA, TTCGAA GGCGCC, TGCGCA, GAATTC, CAGCTG, ACCGGT
thma46.2 CGCG CTAG, ATAT, TATA, AGCT, GATC ? CACGTG, GTCGAC, CGATCG, AAATTT, ACGCGT
trpa52.8 GATC TATA, CTAG, GGCC, ACGT, ATAT ? AAATTT, GCGCGC, CTGCAG, GAATTC, ACATGT
urur25.5? TATA, TTAA, CATG, AGCT, TCGA ?NA

[i] NA, genomes where palindromes are not avoided.

[ii] ?, unknown RS in the species.

[iii] See Methods for species abbreviations.