Table 1.

List of COGs and E. coli Gene Designations by Groups

Group[i] COG number[ii] E. coli gene designation[ii]
10048, 0049, 0051, 0052, 0096, 0098, 0099, 0100[iii], 0103, 0184[iii], 0185[iii], 0186, 0199[iii], 0522, 0080[iii], 0081, 0087, 0088, 0089[iii], 0090[iii], 0091, 0093, 0094, 0097, 0102, 0197, 0198[iii], 0244, 0256[iii], [0050], [0231], 0361, [0480], 0532rpsL, rpsG, rpsJ, rpsB, rpsH, rpsE, rpsM, rpsK, rpsI, rpsO, rpsS, rpsQ, rpsN, rpsD, rplK, rplA, rplC, rplD, rplW, rplB, rplV, rplN, rplE, rplF, rplM, rplP, rplX, rplJ, rplR, [tufB, cysN, tufA, selB], [yeiP, efp], infA, [fusA, prfC], infB
20006[iv], 0024, 0112[iv], 0201, 0541, 0552[pepP, pepQ, ec1788728], map, glyA, secY, ffh, ftsY
30085, 0086, 0180, 0202, 0250, 0258, 0468[iv], 0592rpoB, rpoC, trpS, rpoA, [nusG, rfaH], [exo, polA_1], recA, dnaN
40012, [0037]ychF, [mesJ, ydaO]
5[0008[vi]], 0013[vii], 0016[vi], 0018[vi], 0030[vii], 0060[vi], 0072[vi], 0092[v] [vii], 0101[v] [vii], 0124[vii], 0125[vii], 0143[vi], 0162[vi], 0172[vi], 0200[v] [vi] 0255[v] [vii], 0441[vi], 0442[vi], 0459[iv] [vi], 0470[vi], [0492[v] [vi]], 0495[vi], 0525[vii], 0533[vii], [0550[vii]], [0575[iv] [vii]], 0636[v] [vii], [1109[vi]][glnS, yadB, gltX], alaS, pheS, argS, ksgA, ileS, pheT_2, rpsC, truA, hisS, tmk, metG_1, tyrS, serS, rplO, rpmC, thrS, proS, groL, holB, [trxB, ahpF], leuS, valS, ygjD, [topA_1, topB], [cdsA, ec1787677], atpE, [cpsG, mrsA]
6[0073], [0526][pheT_1, ygiH, metG_2], [trxA, yfiG, dsbD, yejO, ybbN, dsbA]

[i] Group 1: rproteins and translation factors; 2: ribosome associated proteins; 3: transcription and replication proteins; 4: proteins of unknown function; 5: proteins that do not exhibit 3 domain phylogeny; and 6: protein families.

[ii] Square brackets are used to show COGs that contain more than one E. coli ORF.

[iii] COGs for ribosomal proteins that show the Archaea to be polyphyletic, but both the Bacteria and Eucarya are strongly supported monophyletic groups.

[iv] These COGS are missing an ORF for a single bacterium that contains a highly reduced genome and therefore are included in this analysis.

[v] Additional non-three-domain COGs that are missing from a single genome analyzed.

[vi] Non-three-domain COGs with statistically supported lateral gene transfers.

[vii] Non-three-domain COGs with no statistical support for lateral gene transfers.