Table 1.

Assignment Statistics for Each Genome

OrganismCelegsaccaeroafulmjanmthepabyssi
Total residues76873862973530638684662214480086526546535767
Total coverage780323379345110547162470104799116705126282
Percentage covered10.1512.7617.3124.5321.8322.1623.57
Remaining residues69070632594185528137499744375287409841409485
Potential extra domains69070.625941.95281.374997.443752.874098.414094.85
Genes16315628426942388170418551764
Domains assigned7438325987113888759861022
No. of genes with assignment39601897512799538610617
Percent of genes assigned24.2730.1919.0133.4631.5732.8834.98
OrganismPyroaquaebburbsubCjejcpneucpneuA
Total residues5684654825122824551216011508329361654362202
Total coverage118997124311613273138531207157224772085
Percentage covered20.9325.7621.7125.8123.7519.9819.90
Remaining residues449468358201221128902158387614289407290117
Potential extra domains4494.683582.012211.289021.583876.142894.072901.17
Genes206215228254072161910511067
Domains assigned96110525142660993597595
No. of genes with assignment5746062901493588335333
Percent of genes assigned27.8439.8235.1536.6736.3231.8731.21
OrganismCtradra1ecolihinfhpylhpyl99mgen
Total residues3121777770341358281520535495345493679174922
Total coverage6877618176734122914743310145510118042767
Percentage covered22.0323.3925.1228.3220.4820.5024.45
Remaining residues2434015952671017052373102393890392499132155
Potential extra domains2434.015952.6710170.53731.023938.93924.991321.55
Genes89425774266169415231482479
Domains assigned581151826771200803801353
No. of genes with assignment3228901597683473475196
Percent of genes assigned36.0234.5437.4440.3231.0632.0540.92
OrganismMpneumtubnmenApaerrpxxSynechotmar
Total residues237564132916058461318592572789551032549580647
Total coverage4584830754113806745873670406222185144015
Percentage covered19.3023.1423.6224.6725.2421.5224.80
Remaining residues19171610216194465461400521208549810364436632
Potential extra domains1917.1610216.24465.4614005.22085.498103.644366.32
Genes67439152026555783131511813
Domains assigned36925791144391058018931181
No. of genes with assignment208144067322123261112675
Percent of genes assigned30.8636.7833.2239.8139.2335.2937.23
OrganismTpaluurevcho1xfas
Total residues349767227646855150738838
Total coverage6984838947208440164002
Percentage covered19.9717.1124.3722.20
Remaining residues279919188699646710574836
Potential extra domains2799.191886.996467.15748.36
Genes100760925932669
Domains assigned59833017561320
No. of genes with assignment334194969784
Percent of genes assigned33.1731.8637.3729.37

[i] The first of the rows gives the total number of residues within an organism's genes available for structural assignment. The next rows give the number of residues that have a structural assignment and percentage of residues that have an assignment. To complement this the amount of residues left to annotate can provide a crude estimate of how many extra structural domains may be present. This was simply calculated by dividing the remaining residues by a typical domain length of 100 residues (Pearl et al. 2001). The next rows quote the number of genes in the organism, the number of structural domains that have been assigned, and the number of genes that have one or more structural assignments. Finally all of this is summarized as a percentage of genes that have one or more structural assignments.

[ii] celeg: Caenorhabditis elegans; sacc: Saccharomyces cerevisiae; aero: Aeropyrum pernix; aful:Archeoglobus fulgidus; mjan: Methanococcus jannaschii; mthe: Methanobacterium thermoautotrophicum;pabyssi: Pyrococcus abyssi; pyro: Pyrococcus horikoshii; aquae: Aquifex aeolicus; bbur: borrelia burgdoferi; bsub: bacillus subtillus; cjej:Campylobacter jejuni; cpneu: Chlamydia pneumonia;cpneuA: Chlamydophilia pneumoniae; ctra: Chlamydia trachomatis; dra1: Deinococcus radiodurrans; ecoli:Escherichia coli; hinf: Haemophilus influenzae; hpyl:Helicobacter pylori; hpyl99: Helicobacter pylori J99;mgen: Mycoplasma genitalium; mpneu: Mycoplasma pneumoniae; mtub: Mycobacterium tuberculosis; nmenA:Neisseria meningitidis; paer: Pseudomonas aeruginosa;rpxx: Rickettssia prowazekii; syencho: Synechocystis PCC86803; tmar: Thermotoga maritima; tpal: Treponema pallidum; uure: Ureaplasma urealyticum; vchol: Vibrio cholerae; xfas: Xylella fastidiosa.