Table 2.

Full Size Repeat Insertions within the Rickettsia conorii Genes

Gene name Function Phylogenetic distribution[i] Size[ii] (bp) Location of repeat insertion[iii] Structure data[iv]
RPE-1
coxB(RC0555)Cytochrome c oxidase polypeptide II RPG--YOAE 94516..1591OCC/Bos taurus
era (RC0158)GTP-binding protein RPG-SYO-E 101710..1441EGA/Escherichia coli
gltX (RC0966)Glutamyl-tRNA synthetase RPGCSYOAE 15391006..11491GLN/Thermus thermophilus
gmk (RC1194)Guanylate kinase RPGC-YO-E 68719..1261GKY/Saccharomyces  cerevisiae
hemC (RC0706)Porphobilinogen deaminase RPGC-YOAE 1053772..9181PDA/Escherichia coli
kdtA(RC0118)3-deoxy-D-manno-octulosonic-acid  transferase RP-C--O-- 1392147..290
mesJ(RC0067)Cell cycle protein MesJ RPGCSYO-- 1434625..768
mviN(RC0898)Virulence factor MviN RP-CSYO-- 16656..149
pcnB(RC0015)Poly(A)polymerase RPGCSYO-E 1308120..263
rlpA(RC0537)Rare lipoprotein A precursor RP--SYO-- 96033..175
ssrA tmRNA precursor RPGCSYO-- 47478..223
truB(RC0665)tRNA pseudouridine 55 synthase RPGCSYOAE 1035784..927
ubiG(RC0965)3-demethylubiquinone-9  3-methyltransferase RP------E 867148..291
uhiH(RC0848)Ubiquinone biosynthesis protein RP---Y--E 129331..177
 RC1039Split gene of mannose-1-phosphate  guanylyltransferase -P---YOA- 6279..151
 RC0071Unknown function RP---YO-E 1218985..1128
 RC0127Unknown function --------- 22242..185
 RC0183Unknown function --------- 1158706..840
 RC0209Unknown function R-------- 27922..165
 RC0659Unknown function R-------- 58231..174
 RC0675Unknown function --------- 22518..162
 RC0809Unknown function RPGCSYO-- 735349..492
 RC1172Unknown function --------- 34518..161
 RC1201Unknown function --------- 240100..243
RPE-2
atpG(RC1236)ATP synthase γ chain RPG--YO-E 969622..726 1H8E/Bos taurus
ksgA (RC1022)Dimethyladenosine transferase RPGCSYOAE 945370..468 1YUB/Streptococcus  pneumoniae
nuoC (RC0483)NADH dehydrogenase I chain C RPG--YOAE 726199..303
 RC0698Unknown function R-------- 1002100..201
 RC0715Unknown function R-------- 753299..374
RPE-3
envZ(RC0592)Osmolarity sensor protein EnvZ RP------- 143734..149
lpxB(RC0440)Lipid-A-disaccharide synthase RP-C-YO-- 13381138..1253
murD(RC0560)UDP-N-acetylmuramoylalanineD-glutamate ligase RPGCSYO-- 1500796..917 1E0D/Escherichia coli
ptrB (RC0377)Protease II RPG------ 2187526..641 1QFM/Sus scrofa(pig)
RPE-4
 RC0521Unknown function --------- 33629..122
 RC0679Unknown function --------- 1806243..337
RPE-5
 RC0340Unknown function --------- 17145..165
rnpB RNA subunit (M1 RNA) of  ribonuclease P RPGCSYOAE 458118..228
RPE-6
 RC0614Unknown function --------- 387198..334
RPE-7
 RC1210Unknown function --------- 30330..129
RR-1
 RC1196Unknown function --------- 1809..35

[i] Abbreviations for the organism groups are as follows. R: Rickettsia (Rickettsia prowazekii); P: Proteobacteria (Escherichia coli K-12, Haemoplilus influenzae, Xylella fastidiosa, Vibrio cholerae, Pseudomonas aeruginosa, Buchnera sp., Neisseria meningitidis serogroup A and B, Helicobacter pylori 26695 and J99, Campylobacter jejuni); G: Gram positive bacteria (Bacillus subtilis, Bacillus halodurans, Mycoplasma genitalium, Mycoplasma pneumoniae, Ureaplasma urealyticum, Mycobacterium tuberculosis; C:Chlamydia (Chlamydia trachomatis, Chlamydia muridarum, Chlamydia pneumonia CWL029, AR39 and J138); S: Spirochete (Borrelia burgdorferi, Treponema pallidum); Y: Cyanobacteria (Synechocystis); O: Other bacteria (Deinococcus radiodurans, Aquifex aeolicus, Thermotoga maritima); A: Archaea (Methanococcus jannaschii, Methanobacterium thermoautotrophicum, Archaeoglobus fulgidus, Halobacterium sp., Thermoplasma acidophilum, Pyrococcus horikoshii; Pyrococcus abyssi, Aeropyrum pernix); E: Eukaryotes (Saccharomyces cerevisiae, Caenorhabditis elegans, Drosophila melanogaster). When there is no homolog within an organism group, ‘-’ replaces the organism abbreviation.

[ii] Gene size without stop codon.

[iii] The repeat location is indicated by the base position within the gene.

[iv] Protein Data Bank identifiers and the organism names for the available structure data.