Table 2.

Percent G + C Mean and Standard Deviations Determined from All Predicted Protein Coding Regions for Complete Genomes of Pathogenic Bacteria (as of April 2001)

Organism Approximate host range— “Primary” disease Intracellular? Notes regarding clonality and evidence of horizontally transferred regions No. of protein-coding ORFs G + C for ORFs > 300 bp[i]
Mean S.D.
Neisseria meningitidis MC58 humans—meningitisextracellularNonclonal, demonstrated horizontal transfer with other species202552.46.9
Neisseria meningitidis Z2491 humans—meningitisextracellularNonclonal, demonstrated horizontal transfer with other species212152.66.5
Xylella fastidiosa 9a5c plants—citrus variegated chlorosisextracellularEvidence of phage-mediated horizontal gene transfer276653.45.4
Escherichia coli O157:H7 warm-blooded animals, including humans—diarrheafacultative intracellularCompared with E coli K12, has higher % G + C S.D. and more predicted horizontally transferred regions528351.05.3
Mycoplasma pneumoniae M129 humans— mycoplasmal pneumoniaextracellular67740.34.9
Vibrio cholerae N16961 chrom. 2 (of 2) humans, zooplankton, other aquatic life—choleraextracellularMore genes than chr. 1 that appear to have origins outside alpha-proteobacteria to which Vibrio belongs; proposed megaplasmid origin109246.94.3
Treponema pallidum Nichols humans—syphillisextracellular103151.44.2
Pseudomonas aeruginosa PAO1 humans, a range of other animals— variety of opportunistic mucosal infectionsextracellular556567.03.8
Ureaplasma urealyticum serovar3 humans—urethritisextracellular61129.33.8
Vibrio cholerae N16961 chr. 1 (of 2) humans, zooplankton, other aquatic life— choleraextracellular273648.13.7
Borrelia burgdorferi B31 humans, rodents, tick vector—Lyme diseasefacultative intracellular85028.73.6
Campylobacter jejuni NCTC11168 humans, fowl, cattle, sheep, dogs, cats—gastroenteritisextracellularNoted for lack of insertion sequences or phage-associated sequences163430.63.5
Mycoplasma genitalium G37 humans—urethritis (opportunistic)extracellular48031.43.5
Pasteurella multocida PM70 range of animals—fowl cholera, cattle septicemia, pig rhinitisextracellular201440.83.3
Helicobacter pylori 266695 humans—peptic ulcers and gastritisextracellularConserved relative to the other H. pylori genome155339.43.4
Haemophilus influenzae Rd-KW20 humans—upper respiratory infection and meningitisextracellularEvidence of horizontal transfer betweenNeisseria and Haemophilus, but not in this genome sequence170938.53.4
Helicobacter pylori J99 humans—peptic ulcers and gastritisextracellularConserved relative to the other H. pylori genome149139.33.3
Mycobacterium tuberculosis CSU93 humans—tuberculosisfacultative intracellular391865.63.3
Rickettsia prowazekii MadridE humans, other animals, lice vector—epidemic typhusobligate intracellularHighly clonal83430.13.3
Chlamydophila pneumoniae AR39 humans—chlamydial pneumoniaobligate intracellularHighly clonal99741.12.6
Chlamydophila pneumoniae CWL029 humans—chlamydial pneumoniaobligate intracellularHighly clonal105241.12.6
Chlamydophila pneumoniae J138 humans—chlamydia pneumoniaobligate intracellularHighly clonal107041.12.6
Chlamydia trachomatis D humans—chlamydiaobligate intracellularHighly clonal89441.52.3
Chlamydia muridarum MoPn humans—chlamydiaobligate intracellularHighly clonal81840.82.3

[i] The calculation appears to be more accurate when ORFs < 300 bp are omitted from the analysis, as evident from a comparison of two highly similar Chlamydia pneumoniae genomes that suggests there are increased errors in gene prediction for genes > 300 bp in length.

[ii] This table is sorted by percent G + C standard deviation (S.D.) for predicted coding regions (ORFs) > 300 bp. Although the sample size is small, this standard deviation appears to correlate with the clonality of the microbe (two-tailed P-value > 0.005 for a nonparametric/Spearman correlation when condition is ranked). A similar analysis that is continually updated and includes nonpathogens, including archaea, is available athttp://www.pathogenomics.bc.ca/IslandPath.html.