Abstract

Lactococcus lactis is a nonpathogenic AT-rich gram-positive bacterium closely related to the genus Streptococcus and is the most commonly used cheese starter. It is also the best-characterized lactic acid bacterium. We sequenced the genome of the laboratory strain IL1403, using a novel two-step strategy that comprises diagnostic sequencing of the entire genome and a shotgun polishing step. The genome contains 2,365,589 base pairs and encodes 2310 proteins, including 293 protein-coding genes belonging to six prophages and 43 insertion sequence (IS) elements. Nonrandom distribution of IS elements indicates that the chromosome of the sequenced strain may be a product of recent recombination between two closely related genomes. A complete set of late competence genes is present, indicating the ability of L. lactis to undergo DNA transformation. Genomic sequence revealed new possibilities for fermentation pathways and for aerobic respiration. It also indicated a horizontal transfer of genetic information fromLactococcus to gram-negative enteric bacteria ofSalmonella-Escherichia group.

[The sequence data described in this paper has been submitted to the GenBank data library under accession no. AE005176.]


Lactic acid bacteria (LAB) are a heterogeneous group of microorganisms that convert carbohydrates into lactic acid. They comprise both pathogens (such as Streptococcus pneumoniae or Streptococcus pyogenes) and useful bacteria (such as Streptococcus thermophilus and Lactococcus lactis, which were used for millennia in milk fermentation). Determination and analysis of the genome sequence of a representative LAB is therefore of great interest, as it would provide information allowing us to combat the former and use the latter more efficiently. Until now, no complete and annotated genome sequence of either LAB class has been reported.

In nature, L. lactis occupies a niche related to plant or animal surfaces and the animal gastrointestinal tract. It is believed to be dormant on the plant surfaces and to multiply in the gastrointestinal tract after being swallowed by a ruminant. In contrast, “domesticated” species of L. lactis, used by dairy industry as starters in cheese fermentation, live in a different niche, which is defined by technological considerations, such as fast growth and rapid production of lactic acid in milk. The importance ofL. lactis for humankind can be appreciated from the estimate that close to 107 tons of cheese are made annually (Fox 1989), leading to human consumption of close to 1018 lactococci.

There are two subspecies of L. lactis, designated initially asStreptococcus lactis and Streptococcus cremoris and reclassified more recently as L. lactis ssp. lactisand L. lactis ssp. cremoris, respectively (Schleifer et al. 1985). The former is preferred for making of soft cheeses and the latter for the hard ones. The two subspecies have been intensely studied, mainly because of their industrial interest, and have became excellent models for research on metabolism, physiology, genetics, and molecular biology of LAB.

The questions addressed in research on useful bacteria are often antithetical to those involving pathogens, because one of the basic objectives is to improve rather than to limit bacterial growth. Efficient use of lactococci by dairy industry requires understanding of many aspects of bacterial physiology, such as use of sugars and proteins from milk for growth, conversion of sugars to lactate, and synthesis of substances involved in cheese flavor, and thus of the relationship between different types of fermentation. The potential for new applications of LAB, such as oral vaccines (Steidler et al. 2000) or production of foreign proteins and metabolites, leads to questions concerning the protein secretion system, biosynthesis of cofactors, and regulation of central metabolism. In addition to questions related to the industrial use of lactococci, fundamental biological questions, such as retrohoming of introns (Cousineau et al. 1998), are also being addressed in L. lactis.

A genetic map of a “laboratory workhorse” L. lactis ssp.lactis strain IL1403, based on a low-fidelity diagnostic genome sequencing, has been reported (Bolotin et al. 1999). Here we present the analysis of the accurate sequence of the IL1403 genome, which is the first such report for any lactic acid bacterium. We focus mainly on features related to the importance of L. lactis for humankind, which is its use in dairy fermentation. Also, several unexpected findings are reported, such as a putative chimerical structure of the genome, the possibility that L. lactis can respire, the existence of genes required for DNA transformation, and a discovery of a transfer of genetic information from lactococci to gram-negative enteric bacteria.

RESULTS AND DISCUSSION

Two-Step Sequencing Strategy

The first step of our strategy, designated diagnostic genome sequencing, was described before (Bolotin et al. 1999). Briefly, it implies cloning of relatively short (1–20 kb) genome fragments inEscherichia coli plasmid and phage vectors, and sequencing of a limited number of randomly chosen clones, to a redundancy of about one. A novel procedure, designated multiplex long accurate PCR (MLA PCR), developed and tested in the course of the Bacillus subtilis genome sequencing project (Sorokin et al. 1996; Kunst et al. 1997), is then applied for connecting the resulting contigs and synthesizing the missing genome regions, sequenced subsequently by standard methods. This approach allowed us to establish the entireL. lactis genome sequence and assemble it in a unique contig, with a sequencing redundancy of less than two (Bolotin et al. 1999). Three- to fourfold fewer sequencing reactions were required to reach this goal than if the fully random approach were used. For comparison, only 10,235 reactions were needed to assemble L. lactis genome sequence, whereas 40,020 were required for the genome of Neisseria meningitidis (Tettelin et al. 2000), which is of a similar size. Diagnostic sequence allowed us to identify all L. lactis genes that encode proteins sufficiently similar to those present in the databases. However, the elevated error rate, estimated to be ∼1%, did not allow us to predict the genes unique for L. lactis or the borders of coding region. To obtain a more complete and reliable description of the L. lactis genome, we carried out a second step of our strategy. It involved random sequencing of additional clones until the overall redundancy of ∼6.4 was reached and then primer walking on PCR-generated templates to ensure that each base was sequenced at least four times and at least once on each strand. We designated this step “shotgun polishing” and concluded that the strategy presented here can be a good alternative to the fully random strategy used in most cases (Fraser and Fleischmann 1997). Its advantages should increase even more when a greater number of completely sequenced and thoroughly annotated bacterial genomes becomes available. Carrying out the diagnostic step and polishing only a very little will then be sufficient to determine a reliable genome sequence of bacteria relatively close to the ones that were already sequenced and annotated.

Gene Content

The circular chromosome of L. lactis IL1403 has 2,365,589 bp and an average G+C content of 35.4%. We detected 2310 open reading frames (ORFs) in the sequence, with an average length of 879 bp. Protein-coding genes represent 86% of the genome, stable RNA 1.4%, and noncoding regions 12.6%. These values are similar to those observed for genomes of other bacteria. We have assigned a biochemical or biological role to 64.2% (1482 ORFs) of the genes and classified them into functional categories (Table 1). There are 20.1% of genes (465 ORFs) that match hypothetical coding sequences of unknown function, and the remaining 15.7% (363 ORFs) represent genes with no similarity to known proteins, which can be considered specific for lactococci.

Table 1.

Functional Classification of the Lactococcus lactis Protein-Coding Genes#

AMINO-ACID BIOSYNTHESIS
Aromatic amino-acid family
aroA 18023-phosphoshikimate

 1-carboxyvinyltransferase
aroB18143-dehydroquinate synthase
aroC1811chorismate synthase
aroD16903-dehydroquinate dehydratase
aroE1815shikimate 5-dehydrogenase
aroF120Tyr-sensitive

 phospho-2-dehydro-deoxyheptonate

 aldolase
aroH1281Trp-sensitive

 phospho-2-dehydro-deoxyheptonate

 aldolase
aroK 1801shikimate kinase
pheA 1801prephenate dehydratase
trpA 1494tryptophan synthase alpha chain
trpB 1495tryptophan synthase beta chain
trpC 1498indole-3-glycerol phosphate synthase
trpD 1499anthranilate phosphoribosyltransferase
trpE 1501anthranilate synthase component I
trpF 1497phosphorybosyl-anthranilate isomerase
trpG 1500anthranilate synthase component II
tyrA 1803prephenate dehydrogenase
Aspartate family
asnB357asparagine synthetase B
asnH2312asparagine synthetase
aspB1897aspartate aminotransferase
aspC163aspartate aminotransferase
ceo 1265N5-carboxyethyl-ornithine synthase
dapA1665dihydrodipicolinate synthase
dapB1605dihydrodipicolinate reductase
hom 1172homoserine dehydrogenase
lysA1314diaminopimelate decarboxylase
metA1997homoserine O-succinyltransferase
metB11996cystathionine gamma-synthase
metB2791cystathionine gamma-synthase
metE12845-methionine synthase
metF12825,10-methylenetetrahydrofolate reductase
thrA748aspartokinase
thrB 1173homoserine kinase
thrC2173threonine synthase
Branched chain family
ilvA 1251threonine deaminase
ilvB 1248acetolactate synthase large subunit
ilvC 1250ketol-acid reductoisomerase
ilvD 1247dihydroxy-acid dehydratase
ilvN 1249acetolactate synthase small subunit
leuA 12402-isopropylmalate synthase
leuB 12423-isopropylmalate dehydrogenase
leuC 12443-isopropylmalate dehydratase large subunit
leuD 12453-isopropylmalate dehydratase small subunit
Glutamate family
argB808acetylglutamate kinase
argC805N-acetyl-gamma-glutamyl-phosphate

 reductase
argD807acetylornithine aminotransferase
argE560acetylornithine deacetylase
argG127argininosuccinate synthase
argH129argininosuccinate lyase
argJ806ornithine acetyltransferase
glnA2283glutamine synthetase
gltA668citrate synthase
gltB1319glutamate synthase large subunit
gltD1316glutamate synthase small subunit
proA1651gamma-glutamyl phosphate reductase
proB1652glutamate 5-kinase
proC1953pyrroline-5-carboxylate reductase
Histidine family
hisA 1236phosphoribosylformimino-5-aminoimidazole

 carboxamide ribotide isomerase
hisB 1234imidazoleglycerol-phosphate dehydratase
hisC 1229histidinol-phosphate aminotransferase
hisD 1232histidinol dehydrogenase
hisF 1237cyclase HisF
hisG 1231ATP phosphoribosyltransferase
hisH 1235amidotransferase
hisI 1237phosphoribosyl-AMP cyclohydrolase
hisK1238histidinol phosphatase
hisZ 1230ATP phosphoribosyltransferase regulatory subunit
Serine family
cysD77O-acetylhomoserine sulfhydrylase
cysE1921serine acetyltransferase
cysK792cysteine synthase
cysM527cysteine synthase
glyA592serine hydroxymethyltransferase
serA595D-3-phosphoglycerate dehydrogenase
serB596phosphoserine phosphatase
serC594phosphoserine aminotransferase
BIOSYNTHESIS OF COFACTORS, PROSTHETIC GROUPS,

 AND CARRIERS
Folic acid
dfrA1163dihydropholate reductase
fhs961formyltetrahydrofolate synthetase
folB1166dihydroneopterin aldolase
folC1169folylpolyglutamate synthase
folD877tetrahydrofolate dehydrogenase/cyclohydrolase
folE1167GTP cyclohydrolase I
folP1168dihydropteroate synthase
pabA1349para-aminobenzoate synthase component II
pabB1348para-aminobenzoate synthase component I
Heme and porphyrin
hemH1609ferrohelatase
hemK589protoporphyrinogen oxidase
hemN1154oxygen-independent coproporphyrogen III

 oxidase
Menaquinone and ubiquinone
ispA881farnesyl diphosphate synthase
ispB1380heptaprenyl diphosphate synthase

 component II
menB735dihydroxynaphthonic acid synthase
menD7372-oxoglutarate decarboxylase
menE734O-succinylbenzoic acid-CoA ligase
menF739menaquinone-specific isochorismate

 synthase
menX736protein in menaquinone biosynthesis

 pathway
preA187prenyl transferase
ubiE1718menaquinone biosynthesis methylase
yhdB732racemase
Pantothenate
coaA1467pantothenate kinase
dfpA567pantothenate metabolism flavoprotein
dfpB568flavoprotein
panE1358ketopantoate reductase
Riboflavin and cobalamin
cobC1889alpha-ribazole-5'-phosphate phosphatase
cobQ1115cobyric acid synthase
ribA1024GTP cyclohydrolase II /

 3,4-dihydroxy-2-butanone 4-phosphate

 synthase
ribB1023ribiflavin synthase alpha chain
ribC1142riboflavin kinase
ribG1023riboflavin-specific deaminase
ribH1025ribiflavin synthase beta chain
Thioredoxin, glutaredoxin, and glutathione
gpo1402glutathione peroxidase
gshR864glutathione reductase
trxA1692thioredoxin
trxB1966thioredoxin reductase
trxB21695thioredoxin reductase
trxH396thioredoxin H-type
Thiamin
apbE1125thiamine biosynthesis lipoprotein
thiD11295phosphomethylpyrimidine kinase
thiD2485phosphomethylpyrimidine kinase
thiE1294thiamin-phosphate pyrophosphorylase
thiM1295hydroxyethylthiazole kinase
Pyridine nucleotides
nadE1110NAD-synthetase
yvdG2139pyridine nucleotide-disulfide oxidoreductase
CELL ENVELOPE
Membranes, lipoproteins, and porins
bmpA1462basic membrane protein A
cdsA2200phosphatidate cytidylyltransferase
clsA988cardiolipin synthase
clsB1188cardiolipin synthase
dgkA1095diacylglycerol kinase
lgt606prolipoprotein diacylglycerol transferase
pgsA2047CDP-diacylglycerol-phosphate

 phosphatidyltransferase
plpA318outer membrane lipoprotein precursor
plpB319outer membrane lipoprotein precursor
plpC320outer membrane lipoprotein precursor
plpD321outer membrane lipoprotein precursor
yfjC596acylphosphate phosphohydrolase
Murein sacculus and peptidoglycan
acmA 269N-acetylmuramidase
acmB1977N-acetylmuramidase
acmC1403N-acetylmuramidase
acmD528N-acetylmuramidase
asd1667aspartate-semialdehyde dehydrogenase
dacA2356D-alanyl-D-alanine carboxypeptidase
dacB976D-alanyl-D-alanine carboxypeptidase
dal862alanine racemase
ddl341D-alanine-D-alanine ligase
glmU1952UDP-N-acetylglucosamine

 pyrophosphorylase
mraY892phospho-N-acetylmuramoyl-pentapeptide

 transferase
mreC2316cell shape determining protein
mreD2315cell shape determining protein
murA11314UDP-N-acetylglucosamine

 1-carboxyvinyltransferase
murA2535UDP-N-acetylglucosamine

 1-carboxyvinyltransferase
murB1175UDP-N-acetylenolpyruvoylglucosamine

 reductase
murC2119UDP-N-acetylmuramate-alanine ligase
murD1634UDP-N-acetylmuramoylalanine D-glutamate

 ligase
murE1871UDP-MurNac-tripeptide synthetase
murF342D-Ala-D-Ala adding enzyme
murG1633peptidoglycan synthesis protein MurG
murI1313glutamate racemase
pbp1B393penicillin-binding protein 1B
pbp2A2178penicillin-binding protein 2a
pbp2B339penicillin-binding protein 2B
pbpX890penicillin-binding protein
ponA530penicillin-binding protein 1A
racD2310aspartate racemase
uppS2201undecaprenyl pyrophosphate synthetase
Surface polysaccharides, lipopolysaccharides and antigens
dltA1293D-alanine activating enzyme
dltB1291peptidoglycan biosynthesis protein
dltC1290D-alanyl carrier protein
dltD1290D-alanine transfer protein DltD
dltE145oxidoreductase
floL746flotillin-like protein
hasC1378UTP-glucose-1-phosphate uridylyltransferase
icaA681glycosyl transferase
icaB683intercellular adhesion protein IcaB
icaC684collagen adhesin
kdtB2239lipopolysaccharide core biosynthesis protein
mvaA1611hydroxymethylglutaryl-CoA reductase
mycA981myosin-crossreactive antigen
pspA2304glucosyltransferase-S
pspB2306glucosyltransferase-S
rgpA202rhamnosyltransferase
rgpB203rhamnosyltransferase
rgpE207glycosyltransferase
rgpF209polysaccharide biosynthesis protein
tagB953teichoic acid biosynthesis protein B
tagD1220glycerol-3-phosphate cytidiltransferase
tagD2951glycerol-3-phosphate cytidyltransferase
tagF952teichoic acid biosynthesis protein F
tagL936exopolysaccharide biosynthesis protein
tagX948teichoic acid biosynthesis protein
tagY945teichoic acid biosynthesis protein
tagZ943teichoic acid biosynthesis protein
ycbB212glycosyltransferase
ycbD213UDP-glucose 4-epimerase
ycbF215LPS biosynthesis protein
ycbG216LPS biosynthesis protein
ycbH217sugar transferase
ycbI218sugar transferase
ycbJ219LPS biosynthesis protein
ycbK214polysaccharide biosynthesis export protein
yijG899glycosyl transferase
yjeF949lipopolysaccharide biosynthesis protein
ymjE1297glycosyl transferase
ymjF1299UDP-N-acetylglucosamine 2-epimerase
yohH1478lipopolysaccharide biosynthesis protein
yohJ1479lipopolysaccharide biosynthesis protein
ysfC1853polysacharide biosynthesis protein
ywaF2206glycosyltransferase
ywaG2207lipopolysaccharide biosynthesis protein
CELLULAR PROCESSES
Cell division
ezrA2225cell division regulator
ftsA1940cell division protein FtsA
ftsE1000cell-division ATP-binding protein FtsE
ftsH 27cell division protein FtsH
ftsK1705cell division protein FtsK
ftsQ1632cell division protein FtsQ
ftsW1663cell division protein FtsW
ftsW2908cell division protein FtsW
ftsX1001cell division protein
ftsY825cell division protein FtsY
ftsZ1938cell division protein FtsZ
gidA1915glucose inhibited division protein GidA
gidB1381glucose-inhibited division protein GidB
gidC1257glucose inhibited division protein GidC
mesJ24cell cycle protein MesJ
parA99chromosome partitioning protein
rodA917rod-shape determining protein
smc812chromosome segregation SMC protein
Cell killing
hly498hemolysin like protein
Chaperones
dnaK 979DnaK protein
groEL 40060 KD chaperonin
groES 39910 KD chaperonin
sugE25SugE protein
Detoxification
ahpC336alkyl hydroperoxide reductase
ahpF337alkyl hydroperoxide reductase
sodA 413superoxide dismutase
Protein and peptide secretion
ffh1658signal recognition particle protein Ffh
lspA1026lipoprotein signal peptidase
secA118preprotein translocase SecA subunit
secE2175preprotein translocase SecE subunit
secG967protein-export protein SecG
secY2159preprotein translocase SecY subunit
sipL2351signal peptidase I
tig536trigger factor
Transformation
coiA1785competence protein CoiA
comC2104type 4 prepilin-like protein specific leader

 peptidase
comEA1833competence protein ComEA
comEC1832competence protein ComEC
comFA1098competence protein ComFA
comFC1097competence protein ComFC
comGA2189competence protein ComGA
comGB2188competence protein ComGB
comGC2187competence protein ComGC
comGD2187competence protein ComGD
comX2224competence regulator ComX
dprA1254DNA processing SMF protein
radA2150DNA repair protein RadA
recQ1874ATP-dependent DNA helicase RecQ
CENTRAL INTERMEDIARY METABOLISM
General
metK1971S-adenosylmethionine synthetase
pcaC2052gamma-carboxymuconolactone

 decarboxylase
Amino sugars
femD436phosphoglucosamine mutase
glmS1035glucosamine-fructose-6-phosphate

 aminotransferase
nagA1374N-acetylglucosamine-6-phosphate

 deacetylase
nagB1615glucosamine-6-P isomerase
ylfH1157N-acetylglucosamine catabolic protein
ypcD1524endo-beta-N-acetylglucosaminidase
Degradation of polysaccharides
agl1732alpha-glucosidase
amyL1278alpha-amylase
amyY1734alpha-amylase
apu703amylopullulanase
chiA2027chitinase
dexA1736oligo-1,6-glucosidase
dexB1526alpha 1-6-glucosidase
dexC1738neopullulanase
lnbA1527lacto-N-biosidase
xynD282endo-1,4-beta-xylanase D
yucG2028chitin binding protein
Phosphorus compounds
apl719alkaline phosphatase
Polyamine biosynthesis
yqfF1657spermidine acetyltransferase
Other
glgA699glycogen synthase
glgB1471,4-alpha-glucan branching enzyme
glgC697glucose-1-phosphate adenylyltransferase
glgD698glucose-1-phosphate adenylyltransferase
glgP701glycogen phosphorylase
mapA1730maltosephosphorylase
xylH5684-oxalocrotonate tautomerase
ENERGY METABOLISM
Aerobic
cbr144carbonyl reductase
noxA841NADH dehydrogenase
noxB842NADH dehydrogenase
noxC795NADH oxidase
noxD2195NADH oxidase
noxE397NADH oxidase
poxL2130pyruvate oxidase
yahI78short-chain type dehydrogenase
ybdE134oxidoreductase
ybiE186oxidoreductase
ycdG234oxidoreductase
ycgD264oxidoreductase
ycgG267oxidoreductase
yddB333oxidoreductase
ygcA620oxidoreductase
yhgA760oxidoreductase
yiaB802oxidoreductase
ymgK1268oxidoreductase
yneD1343oxidoreductase
ypaI1509oxidoreductase
ypgB1562oxidoreductase
yphA1571NADH dehydrogenase
yphC1574oxidoreductase
ypiA1580oxidoreductase
ypjA1591dehydrogenase
ypjF1595oxidoreductase
ypjH1599oxidoreductase
yrbA1711oxidoreductase
yrfB1751NADH-dependent oxidoreductase
yrjB1791oxidoreductase
yrjC1792iron-binding oxidase subunit
ysjB1892oxidoreductase
yteC1944oxidoreductase
yudI2036oxidoreductase
yugB2066oxidoreductase
yugC2068dehydrogenase
yxdE2338oxidoreductase
Amino acids and amines
ansB743L-asparaginase
araT 57aromatic amino acid specific

 aminotransferase
arcA2115arginine deiminase
arcB2114ornithine carbamoyltransferase
arcC12111carbamate kinase
arcC22110carbamate kinase
arcC31752carbamate kinase
arcT2109aminotransferase
argF809ornithine carbamoyltransferase
bcaT1322branched-chain amino acid

 aminotransferase
gadB 1325glutamate decarboxylase
hicD490L-2-hydroxyisocaproate dehydrogenase
ipd1340indole-3-pyruvate decarboxylase
otcA1757ornithine carbamoyltransferase
pdc2011phenolic acid decarboxylase
pfs19505'-methylthioadenosine/S-adenosylhomocysteine

 nucleosidase
sdaA845alpha-subuni L-serine dehydratase
sdaB844beta-subunit of L-serine dehydratase
yciA281amino acid amidohydrolase
yjiB983amino acid aminohydrolase
ytjE1995aminotransferase
ywjF22993-hydroxyisobutyrate dehydrogenase
Anaerobic
dhaK245dihydroxyacetone kinase
dhaL246dihydroxyacetone kinase
dhaM247dihydroxyacetone kinase
glpD1271glycerol-3-phosphate dehydrogenase
glpK1273glycerol kinase
gpdA1377glycerol-3-phosphate dehydrogenase
lctO1280L-lactate oxidase
ylbE1120oxidoreductase
ATP-proton motive force interconversion
atpA1826ATP synthase alpha subunit
atpB1829ATP synthase subunit a
atpD1824ATP synthase alpha subunit
atpE1823ATP synthase epsilon subunit
atpF1828ATP synthase subunit b
atpG1825ATP synthase gamma subunit
atpH1828ATP synthase delta subunit
Electron transport
cydA708cytochrome D ubiquinol oxidase subunit I
cydB710cytochrome D ubiquinol oxidase subunit II
fer1762ferredoxin
ndrH1006glutaredoxin-like protein NrdH
ndrI1005ribonucleotide reductase
nifJ431pyruvate-flavodoxin oxidoreductase
nifS1928pyridoxal-phosphate dependent

 aminotransferase NifS
nifU1848NifU protein
nifZ523pyridoxal-phosphate dependent

 aminotransferase
qor724quinone oxidoreductase
yfiJ588NADPH-flavin oxidoreductase
yfjE598flavodoxin
yviC2181FMN-binding protein
Entner-Doudoroff
kdgA16722-dehydro-3-deoxyphosphogluconate

 aldolase
kdgK16732-dehydro-3-deoxygluconokinase
Fermentation
ackA12091acetate kinase
ackA22089acetate kinase
adhA1873alcohol dehydrogenase
adhE 2231alcohol-acetaldehyde dehydrogenase
aldB 1253alpha-acetolactate decarboxylase
aldC1117alpha-acetolactate decarboxylase
als1201alpha-acetolactate synthase
butA919acetoin reductase
butB9182,3-butanediol dehydrogenase
frdC1139fumarate reductase flavoprotein subunit
mae1204malate oxidoreductase
mleS 923malolactic enzyme
pfl 659pyruvate-formate lyase
pflA1881pyruvate-formate lyase activating enzyme
pta1709phosphate acetyltransferase
yseE18462-nitropropane deoxygenase
Gluconeogenesis
fbp255fructose-1,6-bisphosphatase
Glycolysis
enoA634enolase
enoB2752-phosphoglycerate dehydratase
fbaA1980fructose-bisphosphate aldolase
gapA 554glyceraldehyde 3-phosphate dehydrogenase
gapB2333glyceraldehyde 3-phosphate dehydrogenase
ldh 1370L-lactate dehydrogenase
ldhB380L-lactate dehydrogenase
ldhX1143L-lactate dehydrogenase
pfk 13726-phosphofructokinase
pgiA2245glucose-6-phosphate isomerase A
pgk243phosphoglycerate kinase
pgmB 442beta-phosphoglucomutase
pmg335phosphoglycerate mutase
pycA665pyruvate carboxylase
pyk 1371pyruvate kinase
tpiA 1149triosephosphate isomerase
yjhF975phosphoglycerate mutase
yrjI1800phosphoglycerate mutase
Pentose phosphate pathway
dxsA15101-deoxyxylulose-5-phosphate synthase
dxsB17251-deoxyxylulose-5-phosphate synthase
gnd609decarboxylating 6-phosphogluconate

 dehydrogenase
ptk1540phosphoketolase
rpe2004ribulose-phosphate 3-epimerase
rpiA2317ribose 5-phosphate isomerase A
tkt1670transketolase
zwf2302glucose-6-phosphate 1-dehydrogenase
Pyruvate dehydrogenase
pdhA64PDH E1 component alpha subunit
pdhB63PDH E1 component beta subunit
pdhC61dihydrolipoamide acetyltransferase

 component of PDH complex
pdhD60lipoamide dehydrogenase component of

 PDH complex
Sugars
bglA423phospho-beta-glucosidase
bglH1490beta-glucosidase
bglS180beta-glucosidase A
galE2055UDP-glucose 4-epimerase
galK2061galactokinase
galM2062aldose 1-epimerase
galT2060galactose-1-phosphate uridylyltransferase
glk2101glucose kinase
gntK2269gluconate kinase
gntZ22716-phosphogluconate dehydrogenase
lacC985tagatose-6-phosphate kinase
lacZ2057beta-galactosidase
maa1735maltose O-acetyltransferase
malQ6954-alpha-glucanotransferase
mtlD34mannitol 1-phosphate 5-dehydrogenase
pmi780mannose-6-phosphate isomerase
rbsK1687ribokinase
scrK1518fructokinase
thgA2058thiogalactoside acetyltransferase
uxaC1674glucuronate isomerase
uxuA1678D-mannonate dehydratase
uxuB1679fructuronate reductase
xylA1550xylose isomerase
xylB1548xylulose kinase
xylM1547aldose 1-epimerase
xylX1543acetyltransferase hypothetical protein
xynB1544beta-1,4-xylosidase
yeeB443sugar hydrolase
ygjD6944-alpha-glucanotransferase
yidC834beta-glucosidase
yncA1321acetyltransferase
ypbG1519sugar kinase
ypcA1521beta-glucosidase
ypdB1532sugar hydrolase
ypdD1537sugar hydrolase
yrcA1722phospho-beta-glucosidase
TCA cycle
citB670aconitate hydratase
citC1207acetate-SH-citrate lyase ligase
citD1208citrate lyase acyl-carrier protein
citE1209citrate lyase beta chain
citF1210citrate lyase alpha chain
citG1211CitG protein
icd672isocitrate dehydrogenase
FATTY ACID AND PHOSPHOLIPID METABOLISM
General
accA790acetyl-CoA carboxylase carboxyl transferase

 subunit alpha
accB786biotin carboxyl carrier protein of acetyl-CoA

 carboxylase
accC788biotin carboxylase
accD789acetyl-CoA carboxylase carboxyl transferase

 subunit betta
acpA782acyl carrier protein
acpD116acyl carrier protein phosphodiesterase
acpS862acyl carrier protein synthase
cfa1972cyclopropane fatty acid synthase
fabD783malonyl CoA-acyl carrier protein

 transacylase
fabF7853-oxoacyl-acyl carrier protein synthase II
fabG17843-oxoacyl-acyl carrier protein reductase
fabG218453-oxoacyl-acyl carrier protein reductase
fabH7823-oxoacyl-acyl-carrier-protein synthase III
fabI562NADH-dependent enoyl-ACP reductase
fabZ1561hydroxymyristoyl-acyl carrier protein

 dehydratase
fabZ27873R-hydroxymyristoyl-acyl carrier protein

 dehydratase
fadA1843acetyl coenzyme A acetyltransferase
fadD655long-chain acyl-CoA synthetase
hmcM1614hydroxymethylglutaryl-CoA synthase
lplL65lipoate-protein ligase
plsX72fatty acid/phospholipid synthesis protein
thiL1613acetyl coenzyme A acetyltransferase
ydiD386acyl carrier protein phosphodiesterase
yeaG408mevalonate kinase
yeaH410diphosphomevalonate decarboxylase
yebA411mevalonate kinase
yscE1830lipase
PURINES, PYRIMIDINES, NUCLEOSIDES AND NUCLEOTIDES
2'-deoxyribonucleotide metabolism
dcdA1156dCMP deaminase
nrdE1004ribonucleoside-diphosphate reductase alpha

 chain
nrdF1002ribonucleoside-diphosphate reductase beta

 chain
Nucleotide and nucleoside interconversions
cmk1761cytidine monophosphate kinase
dukA494deoxynucleoside kinase
dukB1171deoxynucleoside kinase
nucA1101nucleotidase
pyrH2088UMP-kinase
ycjM301phosphatase
Purine ribonucleotide biosynthesis
guaA1517GMP synthase
guaB222IMP dehydrogenase
guaC1159GMP reductase
hprT1561hypoxanthine-guanine

 phosphoribosyltransferase
purA2029adenylosuccinate synthase
purB1689adenylosuccinate lyase
purC1578phosphoribosylaminoimidazole-succinocarboxamide synthetase
purD 1554phosphoribosylamine-glycine ligase
purE 1553phosphoribosylaminoimidazole carboxylase
purF1572phosphoribosylpyrophosphate

 amidotransferase
purH1560bifunctional purine biosynthesis protein

 PurH
purK 1552phosphoribosylaminoimidazole carboxylase
purL1575phosphoribosyl formylglycinamidine

 synthase II
purM1566phosphoribosyl-aminoimidazole synthetase
purN1565phosphoribosylglycinamide

 formyltransferase
purQ1577phosphoribosyl formylglycinamidine

 synttase I
Pyrimidine ribonucleotide biosynthesis
carA1645glutaminase of carbamoyl-phosphate

 synthase
carB 1400carbamoylphosphate synthetase
dut181deoxyuridine 5'-triphosphate

 nucleotidhydrolase
pydA 1593dihydroorotate dehydrogenase A
pydB 1383dihydroorotate dehydrogenase B
pyrB1646aspartate carbamoyltransferase
pyrC1082dihydroorotase
pyrE1081orotate phosphoribosyltransferase
pyrF 1382orotidine-phosphate decarboxylase
pyrZ1384dihydroorotate dehydrogenase electron

 transfer subunit
thyA 1583thymidylate synthase
yeaB404thymidylate kinase
Salvage of nucleosides and nucleotides
add288adenosine deaminase
adk2158adenylate kinase
apt623adenine phosphoribosyltransferase
cdd1463cytidine deaminase
deoB 956phosphopentomutase
deoC1464deoxyribose-phosphate aldolase
deoD 957purine nucleoside phosphorylase
gmk1967guanylate kinase
hpt 25hypoxantine-guanine

 phosphorybosyltransferase
nrdD272anaerobic ribonucleoside-triphosphate

 reductase
nrdG273anaerobic ribonucleoside-triphosphate

 reductase activating protein
pdp1465pyrimidine-nucleoside phosphorylase
prsA826ribose-phosphate pyrophosphokinase
prsB1926ribose-phosphate pyrophosphokinase
udk1710uridine kinase
udp855uridine phosphorylase
upp 1992uracil phosphoribosyltransferase
xpt1160xanthine phosphoribosyltransferase
yfiG585thymidine kinase
Sugar-nucleotide biosynthesis and interconversions
cpsM199dTDP-4-keto-6-deoxyglucose-3,5-epimerase
rmlA197glucose-1-phosphate thymidylyltransferase
rmlB200dTDP-glucose 4,6-dehydratase
rmlC201dTDP-L-rhamnose synthase
REGULATORY FUNCTIONS
General
ahrC883transcriptional regulator
aldR1253regulatory protein AldR
argR2118arginine catabolic regulator
birA11840bifunctional protein BirA
birA21973bifunctional protein BirA
codY164transcriptional regulator
codZ1865transcriptional regulator
copR845transcriptional regulator
fur1506ferric uptake regulator
gadR 1327positive regulator
glnB1636nitrogen regulatory protein P-II
glnR2284glutamine synthetase repressor
gntR2272transcription regulator
nadR2067transcriptional regulator
phoU1771phosphate transport system regulator
purR2351regulator of purine biosynthetic genes
pyrR1648pyrimidine operon regulator
rarA1649transcriptional regulator
rcfA2083transcriptional regulator
rcfB2318transcriptional regulator
relA108ppGpp synthetase I
rmeA1947transcriptional regulator
rmeB1508transcriptional regulator
rmeC237transcriptional regulator
rmeD2053transcriptional regulator
tagR936transcriptional regulator
tenA1839transcriptional regulator TenA
yabA11transcriptional regulator
yabB13transcriptional regulator
ybdA131transcription regulator
ybdG135transcriptional regulator
ybeD146transcriptional regulator
ycdF235transcriptional regulator
ycfA250transcriptional regulator
ydbF316transcriptional regulator
ydcG327transcriptional regulator
yebF418transcriptional regulator
yecA420transcriptional regulator
yecE427transcriptional regulator
yeeG446transcriptional regulator
yfbM518transcriptional regulator
yfeA548transcription regulator
yfjG600transcriptional regulator
ygfC654transcriptional regulator
yhgC763transcriptional regulator
yidA831transcription regulator
yjaD904transcriptional regulator
yjaJ910transcriptional regulator
yjfE955transcription regulator
yjjB992transcriptional regulator
ykhI1078transcriptional regulator
yleF1148transcription regulator
yliA1180positive transcriptional regulator
yljC1193transcriptional regulator
ymcE1223transcriptional regulator
ymiA1285metalloregulator
ynaB1303transcriptional regulator
yogL1469transcriptional regulator
yohC1472transcriptional regulator
ypfD1555transcriptional regulator
ypgC1563transcription regulator
yqbH1618transcriptional regulator
yrbI1717transcriptional regulator
yrfA1750transcription regulator
yrfE1760transcription regulator
ysfD1854regulatory protein
ysfG1857transcriptional regulator
ysgA1866transcriptional regulator
yugA2065transcription regulator
ywdE2237transcription regulator
ywiI228transcriptional regulator
ywjD2297transcription regulator
yxcB2326transcriptional regulator
yxdD2337transcriptional regulator
Two-component systems
kinA 1638sensor protein kinase
kinB 1460sensor protein kinase
kinC 402sensor protein kinase
kinD 912sensor protein kinase
kinE 1032sensor protein kinase
kinF1726sensor protein kinase
kinG1804sensor protein kinase
llrA1639two-component system regulator
llrB1458two-component system regulator
llrC403two-component system regulator
llrD913two-component system regulator
llrE1031two-component system regulator
llrF1727two-component system regulator
llrG1805two-component system regulator
llrH1758two-component system regulator
LacI-family regulators
ccpA 1696catabolite control protein A
rbsR1688ribose operon repressor
rliA1728transcriptional regulator
rliB1536transcriptional regulator
rliC731transcriptional regulator
rliDA2215transcriptional regulator
rliDB2218transcriptional regulator
LysR-family regulators
fhuR331fhu operon transcriptional regulator
mleR 896malolactic fermentation system

 transcriptional activator
mtlR32transcriptional regulator
rlrA1264transcriptional regulator
rlrB1946transcriptional regulator
rlrC1341transcriptional regulator
rlrD381transcriptional regulator
rlrE1598transcriptional regulator
rlrG378transcriptional regulator
AraC-family regulators
adaA519methylphosphotriester-DNA alkyltransferase
xylR1551xylose operon regulator
yneE1344transcriptional regulator
GntR-family regulators
busR1476transcriptional regulator
kdgR1680transcriptional regulator
rgrA437transcriptional regulator
rgrB1461transcriptional regulator
DeoR-family regulators
citR1206citrate lyase regulator
lacR 984lactose transport regulator
rdrA797transcriptional regulator
rdrB1332transcriptional regulator
MarR-family regulators
rmaA750transcriptional regulator
rmaB715transcriptional regulator
rmaC1503transcriptional regulator
rmaD115transcriptional regulator
rmaE1511transcriptional regulator
rmaF1341transcriptional regulator
rmaG781transcriptional regulator
rmaH932transcriptional regulator
rmaI1583transcriptional regulator
rmaJ584transcriptional regulator
zitR2185zinc transport transcriptional regulator
BglG-family regulators
bglR 1493beta-glucoside operon antiterminator
GTP-binding proteins
eraL355GTP-binding protein Era
hflX225GTP-binding protein HflX
obgL1630GTP-binding protein Obg
thdF2328GTP-binding protein ThdF
typA2094GTP-binding protein TypA/BipA
ylqL1330GTP-binding protein
yphL762GTP-binding protein
yqeL224GTP-binding protein
ysxL1165GTP-binding protein
yyaL12GTP-binding protein
REPLICATION
Degradation of DNA
exoA799exodeoxyribonuclease A
nth1084endonuclease III
recJ622single-stranded DNA specific exonuclease
rexA 8subunit A of ATP-dependent exonuclease
rexB 5subunit B of ATP-dependent exonuclease
sbcC1354ATP-dependent dsDNA exonuclease
sbcD1357exonuclease SbcD
uvrA1887excinuclease ABC subunit A
uvrB557excinuclease ABC subunit B
uvrC857excinuclease ABC subunit C
xseA878exonuclease VII large subunit
xseB879exonuclease VII small subunit
DNA replication, restriction, modification, recombination,

 and repair
cshA100chromosome segregation helicase
dinG1900ATP-dependent helicase DinG
dnaA1replication initiation protein DnaA
dnaB758replication protein DnaB
dnaC754replicative DNA helicase
dnaD1083DNA replication protein DnaD
dnaE496DNA polymerase III, alpha chain 2
dnaG545DNA primase
dnaH2279DNA polymerase III, subunits beta and tau
dnaI759primosomal protein DnaI
dnaJ 2308DnaJ protein
dnaN2DNA polymerase III, beta chain
dnaQ1010DNA polymerase III, epsilon chain
gyrA1123DNA gyrase subunit A
gyrB929DNA gyrase subunit B
hexA2294mismatch repair protein MutS
hexB2291DNA mismatch repair protein MutL
holB405DNA polymerase III, delta' subunit
hsdM 645type I restriction enzyme M protein
hsdR 642type I restriction enzyme R protein
hsdS 646type I restriction enzyme specificity protein
hslA502HU like DNA-binding protein
hslB903HU-like DNA-binding protein
ligA425DNA ligase
mutM 358formamidopyrimidine-DNA glycosylase
mutS1693DNA mismatch repair protein MutS
mutX1136mutator protein MutT
mutY859A/G-specific adenine glycosylase
ogt5196-O-methylguanine-DNA methyltransferase
parC1012topoisomerase IV subunit B
parE1008topoisomerase IV subunit B
pcrA1135ATP-dependent helicase PcrA
polA2212DNA polymerase I
polC2192DNA polymerase III, alpha chain
priA1965primosomal protein N'
radC1036DNA repair protein RadC
recA 359RecA protein
recD1798exodeoxyribonuclease V alpha chain
recF2052RecF protein
recG2331ATP-dependent DNA helicase RecG
recM340RecM protein
recN884DNA repair protein RecN
ruvA2290DNA helicase RuvA
ruvB2289DNA helicase RuvB
snf2122SWI/SNF family helicase
ssbA398single-strand binding protein
ssbB2274single-strand binding protein
tag1137DNA-3-methyladenine glycosidase I
topA1256DNA topoisomerase I
umuC 581SOS responce UmuC protein
ung233uracil-DNA glycosilase
xerD635integrase-recombinase
ybaH101acetyl transferase
yffD556diadenosine 5',5”'-P1,P4-tetraphosphate hydrolase
ymgA1260integrase-recombinase
TRANSCRIPTION
Degradation of RNA
pnpA1923polyribonucleotide nucleotidyltransferase
rnc810ribonuclease III
rnhA2350ribonuclease HII
rnhB1329ribonuclease HII
vacB1968ribonuclease
vacB21227ribonuclease
RNA synthesis, modification, and DNA transcription
greA626transcription elongation factor GreA
mfd19transcription-repair coupling factor
nusA774transcription termination protein NusA
nusB693transcription termination protein NusB
nusG2174transcription antitermination protein
papL1603poly(A) polymerase
queA1617S-adenosylmethionine tRNA

 ribosyltransferase
rluA2182pseudouridine synthase
rluB1308pseudouridine synthase
rluC1390pseudouridine synthase
rluD1027pseudouridine synthase
rluE368pseudouridine synthase
rpoA2153DNA-directed RNA polymerase alpha chain
rpoB1863DNA-directed RNA polymerase beta chain
rpoC1859DNA-directed RNA polymerase beta' chain
rpoD 547major RNA polymerase sigma factor
rpoE624DNA-directed RNA polymerase delta chain
rrmA1365rRNA methyltransferase
rsuA2327ribosomal small subunit pseudouridine

 synthase A
sigX2243RNA polymerase ECF sigma factor
smpB1777tmRNA-binding protein SmpB
sunL1958rRNA methylase
trmD1607tRNA methyltransferase
yfjD597tRNA/rRNA methyltransferase
RNA processing
rheA354ATP-dependent RNA helicase
rheB416ATP-dependent RNA helicase
rimM160716S rRNA processing protein
TRANSLATION
Amino acyl tRNA synthetases
alaS1780alanyl-tRNA synthetase
argS2117arginyl-tRNA synthetase
asnS1896asparaginyl-tRNA synthetase
aspS2041aspartyl-tRNA synthetase
cysS1919cysteinyl-tRNA synthetase
gltX2141glutamyl-tRNA synthetase
glyS1102glycyl-tRNA synthetase alpha chain
glyT1104glycyl-tRNA synthetase beta chain
hisS2043histidyl-tRNA synthetase
ileS1933isoleucyl-tRNA synthetase
leuS829leucyl-tRNA synthetase
lysS377lysyl-tRNA synthetase
metS800methyonyl-tRNA synthetase
pheS2010phenylalanil-tRNA synthetase alpha chain
pheT2008phenylalanyl-tRNA synthetase beta chain
proS2197prolyl-tRNA synthetase
serS1768seryl-tRNA synthetase
thrS1988theronyl-tRNA synthetase
trpS68tryptophanyl-tRNA synthetase
tyrS391tyrosyl-tRNA synthetase 1
valS2250valyl-tRNA synthetase
Degradation of proteins, peptides, and glycopeptides
gcp294O-sialoglycoprotein endopeptidase
htrA2205exported serine protease
pepA 394glutamyl aminopeptidase
pepC 1948aminopeptidase C
pepDA249dipeptidase
pepDB1601dipeptidase
pepF 1784oligoendopeptidase F
pepM601mathionine aminopeptidase
pepN 304aminopeptidase N
pepO 1867neutral endopeptidase
pepP 691aminopeptidase P
pepQ1698proline dipeptidase
pepT 1878tripeptidase
pepV 861dipeptidase
pepXP 2136X-prolyl dipeptidyl aminopeptidase
yueE2049protease
yueF2050protease
yugD2069protease
yuhB2071protease
Protein modification
def555polypeptide deformylase
pknB1956serine/threonine protein kinase
pmpA1782protein maturation protein
pmsR2085peptide methionine sulfoxide reductase
pmsX1594peptide methionine sulfoxide reductase
ppiA369peptidyl-prolyl cis-trans isomerase
ppiB914peptidyl-prolyl cis-trans isomerase
pppL1957protein serine/threonine phosphatase
ptpL2284protein-tyrosine phosphatase
ytaD1905protein-tyrosine phosphatase
Ribosomal proteins: synthesis and modification
prmA105methyltransferase
rplA207950S ribosomal protein L1
rplB216850S ribosomal protein L2
rplC217050S ribosomal protein L3
rplD216950S ribosomal protein L4
rplE216450S ribosomal protein L5
rplF216250S ribosomal protein L6
rplI75350S ribosomal protein L9
rplJ130250S ribosomal protein L10
rplK208050S ribosomal protein L11
rplL130150S ribosomal protein L7/L12
rplM234750S ribosomal protein L13
rplN216550S ribosomal protein L14
rplO216050S ribosomal protein L15
rplP216650S ribosomal protein L16
rplQ215250S ribosomal protein L17
rplR216150S ribosomal protein L18
rplS89850S ribosomal protein L19
rplT191150S ribosomal protein L20
rplU109150S ribosomal protein L21
rplV216750S ribosomal protein L22
rplW216950S ribosomal protein L23
rplX216550S ribosomal protein L24
rpmA109150S ribosomal protein L27
rpmB19650S ribosomal protein L28
rpmC216650S ribosomal protein L29
rpmD216050S ribosomal protein L30
rpmE164050S ribosomal protein L31
rpmF9650S ribosomal protein L32
rpmGA66250S ribosomal protein L33
rpmGB9650S ribosomal protein L33
rpmGC217550S ribosomal protein L33
rpmH13450S ribosomal protein L34
rpmI191250S ribosomal protein L35
rpmJ215450S ribosomal protein L36
rpsA85430S ribosomal protein S1
rpsB222830S ribosomal protein S2
rpsC216630S ribosomal protein S3
rpsD28430S ribosomal protein S4
rpsE216130S ribosomal protein S5
rpsF227530S ribosomal protein S6
rpsG235530S ribosomal protein S7
rpsH216230S ribosomal protein S8
rpsI234730S ribosomal protein S9
rpsJ217030S ribosomal protein S10
rpsK215330S ribosomal protein S11
rpsL235530S ribosomal protein S12
rpsM215430S ribosomal protein S13
rpsN216430S ribosomal protein S14
rpsN291130S ribosomal protein S14
rpsO195530S ribosomal protein S15
rpsP161130S ribosomal protein S16
rpsQ216530S ribosomal protein S17
rpsR227430S ribosomal protein S18
rpsS216730S ribosomal protein S19
rpsT179730S ribosomal protein S20
rpsU23730S ribosomal protein S21
ycjC293acetyltransferase
ycjD293acetyltransferase
yhdC740acetyl transferase
yhjG798acetyl transferase
ylxQ776probable ribosomal protein
fmt1962methyonyl-tRNA formyltransferase
gatA166Glu-tRNA amidotransferase subunit A
gatB168Glu-tRNA amidotransferase subunit B
gatC165Glu-tRNA amidotransferase subunit C
ksgA690kasugamycin dimethyltransferase
miaA615tRNA isopentenyltransferase
pth17peptidyl-tRNA hydrolase
rnpA132ribonuclease P protein component
tgt156queuine tRNA-ribosyltransferase
trmH1942tRNA-guanosine methyltransferase
trmU853tRNA-methyltransferase
truA485tRNA pseudouridine synthase A
truB1141tRNA pseudouridine synthase B
Translation factors
efp692elongation factor P
frr2087ribosome recycling factor
fusA2353elongation factor G
infA2154translation initiation factor IF-1
infB777translation initiation factor IF-2
infC1912translation initiation factor IF-3
lepA1118GTP-binding protein LepA
prfA586peptide chain release factor 1
prfB999peptide chain release factor 2
prfC352peptide chain release factor 3
rbfA779ribosome-binding factor A
tsf2227elongation factor Ts
tuf1930elongation factor Tu
TRANSPORT AND BINDING PROTEINS
General
ecsA2075ABC transporter ATP binding protein
ecsB2074ABC transporter permease protein
mscL2171large conductance mechanosensitive

 channel protein
yabE16ABC transporter ATP-binding protein
yahG74ABC transporter ATP binding protein
yaiE87transporter
yajA90transporter
ybaB102ABC transporter ATP binding protein
ycfB251ABC transporter ATP binding protein
ycfC252ABC transporter permease protein
ycfI260ABC transporter ATP binding protein
ycgA261ABC transporter ATP binding protein
ycgB262ABC transporter ATP binding protein
ychD276ABC transporter ATP-binding protein
ychE277ABC transporter ATP-binding protein
ychF278ABC transporter permease protein
ydaG310ABC transporter ATP binding and permease

 protein
ydbA312ABC transporter ATP binding and permease

 protein
ydcE325ABC transporter ATP binding protein
ydcF326ABC transporter permease protein
ydiA382permease
yfcA520ABC transporter ATP binding protein
yfcB521ABC transporter permease protein
yfgE563ABC transporter ATP binding protein
yfgF564ABC transporter permease protein
ygfA652ABC transporter ATP-binding protein
ygfB653ABC transporter permease protein
yhcA721ABC transporter ATP-binding and permease

 protein
yiiF886transporter
yijC894ABC transporter permease protein
yijD895ABC transporter ATP binding protein
yjcA921ABC transporter ATP binding protein
yjjC993ABC transporter ATP-binding protein
yjjD994ABC transporter permease protein
yjjF996transporter
ykhF1074ABC transporter ATP binding protein
ylbA1111ABC transporter ATP-binding protein
ylbB1113ABC transporter permease protein
yljI1199permease
ymeB1245ABC transporter ATP binding protein
ynaC1304ABC transporter ABC binding and permease

 protein
ynaD1306ABC transporter ABC binding and permease

 protein
yngB1364fibronectin-binding protein
ypgD1564ABC transporter ATP binding and permease

 protein
ypjG1597ABC transporter ATP binding protein
yrjE1794transport permease
ysaB1808ABC transporter permease and substrate

 binding protein
ysaC1809ABC transporter ATP-binding protein
ysdA1834ABC transporter permease protein
ysdB1835ABC transporter ATP binding protein
ysfB1852ABC transporter ATP-binding protein
ysiA1882transport protein
ysiB1883permease
ytaB1902transport protein
yteD1945transmembrane efflux protein
yudA2031transport protein
yujD2097ABC transporter ATP binding protein
ywiG2285ABC transporter ATP binding protein
ywiH2286ABC transporter permease protein
yxaA2300permease
yxdG2340transporter
yxeB2349ABC transporter ATP-binding protein
yxfA2358transporter
Amino acids, peptides and amines
arcD12112arginine/ornitine antiporter
arcD22107arginine/ornitine antiporter
brnQ685branched chain amino acid permease
busAA 1475betaine ABC transporter ATP binding

 protein
busAB 1474betaine ABC transporter permease and

 substrate binding protein
choQ865choline ABC transporter ATP binding protein
choS867choline ABC transporter permease and

 substrate binding protein
ctrA113cationic amino acid transporter
dtpT 705di-/tripeptide transporter
gadC 1326glutamate-gamma-aminobutyrate antiporter
glnP1818glutamine ABC transporter permease and

 substrate binding protein
glnQ1819glutamine ABC transporter ATP-binding

 protein
gltP1856glutamate ABC transporter permease protein
gltQ1855glutamate ABC transporter ATP-binding

 protein
gltS559glutamate or arginine ABC transporter

 substrate binding protein
lysP2277lysine specific permease
lysQ370lysine specific permease
oppA 1906oligopeptide ABC trasporter substrate

 binding protein
oppB 1908oligopeptide ABC trasporter permease

 protein
oppC 1907oligopeptide ABC trasporter permease

 protein
oppD 1910oligopeptide ABC trasporter ATP binding

 protein
oppF 1909oligopeptide ABC trasporter ATP binding protein
optA346oligopeptide ABC transporter substrate

 binding protein
optB347oligopeptide ABC trasporter permease

 protein
optC348oligopeptide ABC trasporter permease

 protein
optD349oligopeptide ABC trasporter ATP binding

 protein
optF350oligopeptide ABC trasporter ATP binding

 protein
optS344oligopeptide ABC transporter substrate

 binding protein
potA1176spermidine/putrescine ABC transporter

 ATP-binding protein
potB1177spermidine/putrescine ABC transporter

 permease protein
potC1178spermidine/putrescine ABC transporter

 permease protein
potD1179spermidine/putrescine ABC transporter

 substrate binding protein
yagE70amino acid permease
ydcB322amino acid ABC transporter ATP binding

 protein
ydcC323amino acid ABC trasporter permease protein
ydgB361amino acid permease
ydgC362amino acid permease
yfcG525peptide-binding protein
yibG819amino acid permease
yjgC963amino acid ABC transporter substrate

 binding protein
yjgD964amino acid ABC transporter permease

 protein
yjgE965amino acid ABC transporter ATP binding

 protein
ylcA1121amino acid permease
yqfD1655amino acid permease
yrfD1756amino acid antiporter
yshA1876amino acid permease
ysjA1891amino acid permease
yvdF2138amino acid ABC transporter substrate

 binding protein
Anions
phnA2332alkylphosphonate uptake protein
phnB299phosphonate ABC transporter permease

 protein
phnC298phosphonate ABC transporter ATP-binding

 protein
phnE299phosphonate ABC transporter permease

 protein
pstA1772phosphate ABC transporter ATP binding

 protein
pstB1772phosphate ABC transporter ATP binding

 protein
pstC1773phosphate ABC transporter permease

 protein
pstD1774phosphate ABC transporter permease

 protein
pstE1775phosphate ABC transporter substrate

 binding protein
pstF1776phosphate ABC transporter substrate

 binding protein
yafB52sulfate transporter
Carbohydrates, organic alcohols and acids
glpF1248glycerol uptake facilitator
glpF21270glycerol uptake facilitator
glpT549glycerol-3-phosphatase transporter
gntP2266gluconate permease
lacS2063lactose permease
maeP1205malate permease
malE1740maltose ABC transporter substrate binding

 protein
malF1741maltose ABC transporter permease protein
malG1742maltose ABC transporter permease protein
mleP 924malate transporter
msmK428multiple sugar ABC transporter ATP-binding

 protein
rbsA1685ribose ABC transporter ATP binding protein
rbsB1683ribose ABC transporter substrate binding

 protein
rbsC1684ribose ABC transporter permease protein
rbsD1686ribose ABC transporter permease protein
rgpC204polysaccharide ABC transporter permease

 protein
rgpD205polysaccharide ABC transporter ATP-binding

 protein
tagG939teichoic acid ABC transporter permease

 protein
tagH938teichoic acid ABC transporter ATP binding

 protein
uxuT1676Na-galactoside symporter
xylT1542D-xylose proton-symporter
xynT1546xyloside transporter
yngE1366sugar ABC transporter ATP binding protein
yngF1368sugar ABC transporter permease protein
yngG1369sugar ABC transporter permease protein
ypbD1515sugar transport symporter
ypcG1528sugar ABC transporter substrate binding

 protein
ypcH1529sugar ABC transporter permease protein
ypdA1530sugar ABC transporter substrate binding

 protein
yqgE1668transporter
yvdD2134transporter
Cations
amtB1636ammonium transporter
cadA97cadmium efflux ATPase
copA847copper/potassium-transporting ATPase
copB872copper-potassium transporting ATPase B
feoA192ferrous ion transport protein A
feoB191ferrous ion transport protein B
fhuB328ferrichrome ABC transporter permease

 protein
fhuC327ferrichrome ABC transporter ATB binding

 protein
fhuD330ferrichrome ABC transporter substrate

 binding protein
fhuG329ferrichrome ABC transporter permease

 protein
kupA610potassium uptake protein
kupB613potassium uptake protein
mgtA1287cation-transporting P-ATPase
mtsA1350manganese ABC transporter substrate

 binding protein
mtsB1351manganese ABC transporter ATP binding

 protein
mtsC1351manganese ABC transporter permease

 protein
nah1994Na+/H+ antiporter
pacL677cation-transporting ATPase
ydaE308cation transporter
yddA332transporter
ydiF388Na+/H+ antiporter
yfgQ570cation-transporting ATPase
ygfE657divalent cation transport-related protein
yieF846mercuric reductase
yjdJ937potassium channel protein
yliI1190cation-transporting ATPase
yndG1337metal ABC transporter substrate binding

 protein
yoaB1404cation-transporting ATPase
yogJ1468cation transporter
ypbB1512cationic transporter
yqeI1650cation transport protein
yqgG1664cation transport ATPase
ysdE1838cation transporter
ytjB1990manganese transporter
yuiA2081metal transporting ATPase
yxdC2336cation-transporting ATPase
zitP2183zinc ABC transporter permease protein
zitQ2183zinc ABC transporter ATP binding protein
zitS2184zinc ABC transporter substrate binding

 protein
Nucleosides, purines and pyrimidines
pbuX1161xanthine permease
pnuC1856nicotinamide mononucleotide transporter
pnuC2901nicotinamide mononucleotide transporter
pyrP1647uracil permease
PTS system
celB178cellobiose-specific PTS system IIC

 component
fruA986fructose-specific PTS system enzyme IIBC

 component
mtlA29mannitol-specific PTS system IIBC

 component
mtlF33mannitol-specific PTS system IIA component
ptbA1492beta-glucoside-specific PTS system IIABC

 component
ptcA419cellobiose-specific PTS system IIA

 component
ptcB419cellobiose-specific PTS system IIB

 component
ptcC421cellobiose-specific PTS system IIC

 component
ptnAB1763mannose-specific PTS system component

 IIAB
ptnC1764mannose-specific PTS system component IIC
ptnD1765mannose-specific PTS system component IID
ptsH120phosphocarrier protein Hpr
ptsI122phosphoenolpyruvate-protein

 phosphotransferase
ptsK605Hpr(Ser) kinase
yedF439beta-glucoside-specific PTS system IIABC

 component
yidB832cellobiose-specific PTS system IIC

 component
yleD1146sucrose-specific PTS system IIBC component
yleE1147beta-glucoside-specific PTS system IIABC

 component
Multidrug resistance
blt126multidrug efflux transporter
cydC711cytochrome D ABC transporter ATP binding

 and permease protein
cydD713cytochrome D ABC transporter ATP binding

 and permease protein
lcnC 84lactococcin A ABC transporter ATP binding

 and permease protein
lcnD 85lactococcin A ABC transporter permease

 protein
lmrA 717multidrug resistance ABC transporter ATP

 binding and permease protein
lmrP2242integral membrane protein LmrP
napC306multidrug-efflux transporter
pmrA661multidrug resistance efflux pump
pmrB130multidrug resistance efflux pump
ybfD158transporter
ycdH236transporter
ydiC385efflux pump antibiotic resistance protein
yfjF599membrane-bound transport protein
yjdE933multidrug resistance protein
yniG1386drug-export protein
ypfE1557transport protein
ypiB1582transporter
yqiA1682multidrug transporter
yweA2240membrane protein
yxbD2319transporter
OTHER CATEGORIES
Adaptations and atypical conditions
arsC1412arsenate reductase
clpB1568ClpB protein
clpC631ATP-dependent protease ATP-binding

 subunit
clpE552ATP-dependent protease ATP-binding

 subunit
clpP673ATP-dependent Clp protease proteolytic

 subunit
clpX1164ATP dependent Clp protease
cpo835non-heme chloride peroxidase
cspD517cold shock protein D
cspE173cold shock protein E
cstA414carbon starvation protein
ctsR630transctiptional regulator CtsR
dinF172damage-inducible protein DinF
dinP2105DNA-damage-inducible protein P
dpsA2102non-heme iron-binding ferritin
grpE978stress responce protein GrpE
hrcA977heat-inducible transcription repressor HrcA
osmC69osmotically inducible protein
phoL1094phosphate starvation inducible protein
tpx302thiol peroxidase
ybjA193reductase
yjbE915general stress protein GSP13
Drug and analog sensitivity
bacA2276undecaprenol kinase
bar1837acyltransferase
pacA1150penicillin acylase
pacB1904penicillin acylase
ymdC1234kanamycin kinase
Phage related functions and prophages
pi101448prophage pi1 protein 01, integrase
pi102449prophage pi1 protein 02
pi103 450prophage pi1 protein 03, transcriptional

 regulator
pi104450prophage pi1 protein 04, transcriptional

 regulator
pi105451prophage pi1 protein 05
pi106451prophage pi1 protein 06
pi107451prophage pi1 protein 07
pi108452prophage pi1 protein 08
pi109452prophage pi1 protein 09
pi110453prophage pi1 protein 10, transcriptional

 regulator
pi111453prophage pi1 protein 11, recombinase
pi112454prophage pi1 protein 12
pi113455prophage pi1 protein 13, replisome

 organiser
pi114456prophage pi1 protein 14, DNA replication

 protein
pi115457prophage pi1 protein 15
pi116457prophage pi1 protein 16
pi117458prophage pi1 protein 17
pi118458prophage pi1 protein 18
pi119459prophage pi1 protein 19
pi120459prophage pi1 protein 16, deoxyuridine

 5'-triphosphate nucleotidohydrolase
pi121459prophage pi1 protein 21
pi122460prophage pi1 protein 22
pi123460prophage pi1 protein 23
pi124460prophage pi1 protein 24
pi125461prophage pi1 protein 25
pi126461prophage pi1 protein 26
pi127461prophage pi1 protein 27
pi128462prophage pi1 protein 28
pi129462prophage pi1 protein 29
pi130462prophage pi1 protein 30
pi131463prophage pi1 protein 31
pi132464prophage pi1 protein 32
pi133464prophage pi1 protein 33, terminase small

 subunit
pi134465prophage pi1 protein 34, terminase large

 subunit
pi135467prophage pi1 protein 35
pi136468prophage pi1 protein 36, prohead protease
pi137469prophage pi1 protein 37, capsid protein
pi138470prophage pi1 protein 38
pi139470prophage pi1 protein 39
pi140470prophage pi1 protein 40, tail component
pi141471prophage pi1 protein 41, tail component
pi142471prophage pi1 protein 42, small major

 structural protein
pi143472prophage pi1 protein 43
pi144474prophage pi1 protein 44, tail component
pi145477prophage pi1 protein 45, tail component
pi146480prophage pi1 protein 46, tail component
pi147482prophage pi1 protein 47
pi148482prophage pi1 protein 48, holin
pi149483prophage pi1 protein 49, muramidase
pi2011037prophage pi2 protein 01, integrase
pi2021038prophage pi2 protein 02
pi2031039prophage pi2 protein 03
pi2041039prophage pi2 protein 04 hypothetical

 protein
pi2051040prophage pi2 protein 05
pi2061040prophage pi2 protein 06
pi2071041prophage pi2 protein 07
pi2081041prophage pi2 protein 08
pi2091042prophage pi2 protein 09
pi2101042prophage pi2 protein 10
pi2111043prophage pi2 protein 11, topoisomerase
pi2121043prophage pi2 protein 12, single strand

 binding protein
pi2131044prophage pi2 protein 13, replisome

 organiser
pi2141045prophage pi2 protein 14
pi2151045prophage pi2 protein 15
pi2161046prophage pi2 protein 16
pi2171046prophage pi2 protein 17
pi2181047prophage pi2 protein 18
pi2191048prophage pi2 protein 19
pi2201048prophage pi2 protein 20 hypothetical

 protein
pi2211049prophage pi2 protein 21, deoxyuridine

 5'-triphosphate nucleotidohydrolase
pi2221049prophage pi2 protein 22
pi2231049prophage pi2 protein 23
pi2241050prophage pi2 protein 24
pi2251051prophage pi2 protein 25
pi2261051prophage pi2 protein 26
pi2271052prophage pi2 protein 27
pi2281052prophage pi2 protein 28
pi2291053prophage pi2 protein 29
pi2301054prophage pi2 protein 30, terminase
pi2311055prophage pi2 protein 31
pi2321056prophage pi2 protein 32
pi2331057prophage pi2 protein 33, capsid protein
pi2341058prophage pi2 protein 34
pi2351059prophage pi2 protein 35
pi2361059prophage pi2 protein 36
pi2371059prophage pi2 protein 37
pi2381060prophage pi2 protein 38
pi2391060prophage pi2 protein 39
pi2401061prophage pi2 protein 40
pi2411061prophage pi2 protein 41
pi2421062prophage pi2 protein 42
pi2431064prophage pi2 protein 43
pi2441065prophage pi2 protein 44
pi2451068prophage pi2 protein 45
pi2461069prophage pi2 protein 46
pi2471069prophage pi2 protein 47
pi2481070prophage pi2 protein 48
pi2491070prophage pi2 protein 49
pi2501070prophage pi2 protein 50
pi2511071prophage pi2 protein 51, holin
pi2521071prophage pi2 protein 52, muramidase
pi3011414prophage pi3 protein 01
pi3021415prophage pi3 protein 02
pi3031415prophage pi3 protein 03
pi3041416prophage pi3 protein 04
pi3051416prophage pi3 protein 05, muramidase
pi3061417prophage pi3 protein 06, holin
pi3071418prophage pi3 protein 07
pi3081419prophage pi3 protein 08
pi3091420prophage pi3 protein 09
pi3101421prophage pi3 protein 10
pi3111422prophage pi3 protein 11
pi3121424prophage pi3 protein 12
pi3131425prophage pi3 protein 13, tail component
pi3141428prophage pi3 protein 14
pi3151431prophage pi3 protein 15
pi3161431prophage pi3 protein 16, tail component
pi3171432prophage pi3 protein 17, major tail protein
pi3181433prophage pi3 protein 18, tail component
pi3191433prophage pi3 protein 19, tail component
pi3201433prophage pi3 protein 20, head-tail joining

 protein
pi3211434prophage pi3 protein 21
pi3221435prophage pi3 protein 22, major head

 protein precursor
pi3231436prophage pi3 protein 23, ATP dependent

 Clp protease
pi3241436prophage pi3 protein 24
pi3251437prophage pi3 protein 25, head-tail joining

 protein
pi3261438prophage pi3 protein 26, terminase large

 subunit
pi3271439prophage pi3 protein 27, terminase small

 subunit
pi3281440prophage pi3 protein 28
pi3291440prophage pi3 protein 29
pi3301441prophage pi3 protein 30
pi3311441prophage pi3 protein 31
pi3321442prophage pi3 protein 32
pi3331443prophage pi3 protein 33
pi3341443prophage pi3 protein 34
pi3351443prophage pi3 protein 35, deoxyuridine

 5'-triphosphate nucleotidohydrolase
pi3361444prophage pi3 protein 36
pi3371444prophage pi3 protein 37
pi3381445prophage pi3 protein 38
pi3391445prophage pi3 protein 39
pi3401446prophage pi3 protein 40
pi3411446prophage pi3 protein 41
pi3421446prophage pi3 protein 42
pi3431447prophage pi3 protein 43
pi3441447prophage pi3 protein 44
pi3451447prophage pi3 protein 45
pi3461448prophage pi3 protein 46, DNA replication

 protein
pi3471449prophage pi3 protein 47, replisome

 organiser
pi3481450prophage pi3 protein 48, single strand

 binding helix destabilising protein
pi3491450prophage pi3 protein 49
pi3501451prophage pi3 protein 50
pi3511451prophage pi3 protein 51
pi3521452prophage pi3 protein 52
pi3531452prophage pi3 protein 53
pi3541452prophage pi3 protein 54
pi3551453prophage pi3 protein 55, antirepressor
pi3561453prophage pi3 protein 56, cro-like repressor
pi3571454prophage pi3 protein 57, cI-like repressor
pi3581455prophage pi3 protein 58
pi3591455prophage pi3 protein 59
pi3601456prophage pi3 protein 60, integrase
pip 1720phage infection protein
ps10136prophage ps1 protein 01, hypothetical

 regulator
ps10236prophage ps1 protein 02
ps10337prophage ps1 protein 03, terminase subunit
ps10437prophage ps1 protein 04
ps10538prophage ps1 protein 05, DNA primase
ps10640prophage ps1 protein 06
ps10740prophage ps1 protein 07
ps10841prophage ps1 protein 08
ps10941prophage ps1 protein 09
ps11041prophage ps1 protein 10
ps11142prophage ps1 protein 11, transcriptional

 regulator
ps11242prophage ps1 protein 12
ps11342prophage ps1 protein 13
ps11443prophage ps1 protein 14
ps11544prophage ps1 protein 15, transcriptional

 regulator
ps11644prophage ps1 protein 16
ps11745prophage ps1 protein 17
ps11845prophage ps1 protein 18
ps11945prophage ps1 protein 19
ps12046prophage ps1 protein 20
ps12147prophage ps1 protein 21
ps12248prophage ps1 protein 22
ps12349prophage ps1 protein 23, integrase
ps201503prophage ps2 protein 01, integrase
ps202504prophage ps2 protein 02
ps203505prophage ps2 protein 03
ps204505prophage ps2 protein 04
ps205506prophage ps2 protein 05, transcriptional

 repressor
ps206506prophage ps2 protein 06
ps207506prophage ps2 protein 07, excisionase
ps208507prophage ps2 protein 08
ps209507prophage ps2 protein 09
ps210507prophage ps2 protein 10
ps211508prophage ps2 protein 11
ps212508prophage ps2 protein 12
ps213508prophage ps2 protein 13
ps214509prophage ps2 protein 14
ps215510prophage ps2 protein 15
ps216511prophage ps2 protein 16
ps218512prophage ps2 protein 18
ps219512prophage ps2 protein 19
ps220513prophage ps2 protein 20
ps221514prophage ps2 protein 21
ps3012014prophage ps3 protein 01
ps3022015prophage ps3 protein 02
ps3032015prophage ps3 protein 03
ps3042016prophage ps3 protein 04
ps3052016prophage ps3 protein 05
ps3062018prophage ps3 protein 06
ps3072019prophage ps3 protein 07
ps3082019prophage ps3 protein 08
ps3092019prophage ps3 protein 09
ps3102020prophage ps3 protein 10
ps3112021prophage ps3 protein 11
ps3122022prophage ps3 protein 12
ps3132023prophage ps3 protein 13
ps3142023prophage ps3 protein 14,

 transcriptional regulator
ps3152024prophage ps3 protein 15
ps3162025prophage ps3 protein 16, integrase
Transposon related functions
tra1077A 53transposase of IS1077A
tra1077B 140transposase of IS1077B
tra1077C 375transposase of IS1077C
tra1077D 628transposase of IS1077D
tra1077E 838transposase of IS1077E
tra1077F 2156transposase of IS1077F
tra1077G 2217transposase of IS1077G
tra904A 54transposase of IS904A
tra904B 138transposase of IS904B
tra904C 40transposase of IS904C
tra904D 374transposase of IS904D
tra904E 627transposase of IS904E
tra904F 836transposase of IS904F
tra904G 839transposase of IS904G
tra904H 2155transposase of IS904H
tra904I 2215transposase of IS904I
tra905 1225transposase of IS905
tra981A 92transposase of IS981A
tra981B 93transposase of IS981B
tra981C 651transposase of IS981C
tra981D 729transposase of IS981D
tra981E 1217transposase of IS981E
tra981F 1222transposase of IS981F
tra981G 1276transposase of IS981G
tra981H 1586transposase of IS981H
tra981I 1748transposase of IS981I
tra981J 2103transposase of IS981J
tra982 640transposase of IS982
tra983A682transposase of IS983A
tra983B707transposase of IS983B
tra983C958transposase of IS983C
tra983D1338transposase of IS983D
tra983E1396transposase of IS983E
tra983F1556transposase of IS983F
tra983G1755transposase of IS983G
tra983H1954transposase of IS983H
tra983I1978transposase of IS983I
tra983J2012transposase of IS983J
tra983K2017transposase of IS983K
tra983L2084transposase of IS983L
tra983M2148transposase of IS983M
tra983N2203transposase of IS983N
tra983O2268transposase of IS983O
yafG53hypothetical protein
yafI55hypothetical protein
yajE92transposase
yajG94transposase
ybdK138hypothetical protein
ybdL139hypothetical protein
ybeG141hypothetical protein
ydhD373hypothetical protein
ydhE375hypothetical protein
yfjB593transposon-related protein
ygcD628hypothetical protein
ygcE629hypothetical protein
ygfF651transposase
yhcJ729transposase
yidF837hypothetical protein
yidG838hypothetical protein
yidH839hypothetical protein
ymbA1212integrase
ymbI1217transposase
ymcD1222transposase
ymfD1259integrase-recombinase
ymhB1276transposase
ypiI1587transposase
yrdA1748transposase
yuiI2104transposase
yvfC2157hypothetical protein
yvfD2156hypothetical protein
yvjF2216hypothetical protein
ywbC2217hypothetical protein
Other
crtK574carotenoid biosynthetic protein CrtK
yebB412carotenoid biosynthetic protein

[i] #Gene symbols, coordinates in kb, and definitions are shown. Bold italic symbols correspond to gene functions experimentally confirmed in L. lactis.

Origin and Terminus of Replication

Approximate position of the replication origin and terminus of theL. lactis chromosome was determined previously, using the GC and AT skews (Fig. 1; Bolotin et al. 1999). It should be noted that the precision of the origin mapping is greater than that of the terminus, as there are conserved elements (dnaA and dnaN genes, DnaA boxes) in the vicinity of the former but not of the latter (rtp gene was not found). We choose as the coordinate 1 of the genome the middle of aHindIII site localized near the replication origin (Fig 1).

Figure 1.

Distribution of IS elements and prophages in the IL1403 chromosome. Outer circle shows the scale in basepairs. IS981, IS983, and IS1077 are shown by yellow, red, and blue squares, respectively (enlarged for clarity). Red and blue arcs show the areas of IS983 and IS1077 insertions. pi prophages are shown in violet, and ps in black. Green circle shows GC skews (C−G/C+G) distribution (window 20 kb, step 5 kb), which indicates the origin (near bp 1) and terminus (near bp 1,260,000) of replication.

5f1_C4TT

RNA, IS Elements, and Prophages

Location of six rRNA operons, 62 tRNA genes, the RNA component of RNAse P gene (rnpB), and the 10S RNA (ssrA) were determined earlier from the diagnostic sequence (Bolotin et al. 1999). There are six different IS elements in the IL1403 chromosome: IS981, IS982, IS983, IS904, IS905, and IS1077, present in 10, 1, 15, 9, 1, and 7 copies, respectively (Fig. 1) and totaling 42 kb. It is remarkable that one or two copies of IS904 always accompany IS1077 and that the relative orientation of the two is generally not the same. The former element might be a satellite of the latter. Another remarkable feature is that three of the IS elements are not randomly distributed over the chromosome (Fig. 1). Seven copies of IS1077 (and the associated IS904) occupy the region between 2150 and 840 kb, encompassing the replication origin, whereas 15 copies of IS983 occupy a different region, between 680 and 2270 kb. The two regions overlap by only ∼150 kb. As the 10 copies of IS981 are distributed over the whole genome, the uneven distribution of three other IS elements is not caused by a particular property of the L. lactis cell. We suggest that this distribution indicates a lateral transfer of a large portion of the genome from a lactococcus donor, carrying one type of IS, to a recipient, carrying the other type. Two lines of evidence lend support to this hypothesis. First, IS1076, which corresponds to the association of IS1077 and IS904 described above, is distributed over the whole genome of the strain L. lactis ssp. cremoris MG1363 (Le Bourgeois et al. 1995) rather than being restricted to one region of the genome, as is the case in IL1403. This transposon has, therefore, no particular hot region for insertion in the lactococcal genome. Second, the restriction map of another strain, L. lactis ssp. lactis DL11, coincides with that of IL1403 in the area between rrnF (550 kb) and rrnE (1980 kb), while it is divergent elsewhere (Le Bourgeois et al. 1992). We suggest that DL11 may be close to one of the putative parental strains of IL1403. Investigations of the distribution of IS1077 and IS983 among different lactococci might allow identification of both putative parents of the IL1403 strain.

Three potential prophages, designated pi1, pi2, and pi3, were detected at positions near 460, 1050, and 1460 kb (Fig. 1). They are large (35–44 kb), encode 49–60 proteins, and are related to known temperate phages of L. lactis. Another three prophages, designated ps1, ps2, and ps3, are localized near 42, 509, and 2020 kb (Fig. 1). They are small (11–15 kb), encode only 16–23 proteins and might be satellites of the other phages, as they lack most of the genes that code for phage structural elements. A copy of IS983 is present in ps3, which might, thus, be defective. The six prophages comprise a total of 175 kb of DNA and 221 protein coding genes. Recently, Chopin et al. (2001) characterized five phages, which can be found in the supernatant of IL1403 after mitomycin C treatment, and demonstrated the correspondence between the phage DNA extracted from the supernatant and the chromosome sequence. Phage bIL285 from the supernatant corresponds to pi2, bIL286 to pi3, bIL309 to pi1, bIL310 to ps1, and bIL312 to ps2. ps3, designated also as bIL311 (Chopin et al. 2001), cannot be induced, probably because of the IS983 element present in its genome. Detecting the circular forms of DNA of these phages allowed precise determination of the integration sites. About 9.2% of the L. lactis genome is thus formed by IS elements and prophages, suggesting that they may be important for horizontal gene transfer in these bacteria.

Paralogous Gene Families

We define here as a paralogous protein family a group of proteins within which each protein shares at least one homologous domain with another protein of the group. By this criterion, there are 370 paralogous families, comprising 1189 gene products, in the L. lactis genome. Among the smaller families (<10 members) there are 208 of two members, 80 of three, 36 of four, 13 of five, 13 of six, 8 of seven, 4 of eight, and 2 of nine. The larger families contain 10, 11, 15, 18, 26, and 60 members, the last corresponding to ATP-binding proteins of ABC transporters, as is the case in many bacteria. In the four smallest families, distribution of the number of proteins resembles that of B. subtilis (Kunst et al. 1997). It decreases, very approximately, twofold when the family member count increases by one (568:273:168:100 in B. subtilisand 416:240:144:65 in L. lactis for doublets, triplets, quadruplets, and quintuplets, respectively).

Information Processing and Gene Regulation

Information processing refers to the genes constituting replication, transcription, and translation machinery. In L. lactis, it is overall very similar to that of B. subtilis, the best characterized AT-rich gram-positive bacterium (Kunst et al. 1997). There are 67 genes involved in DNA metabolism in L. lactis. All the genes involved in DNA replication in B. subtilis are present in L. lactis, including counterparts of dnaB,dnaD, and dnaI, genes essential for initiation of replication in B. subtilis and absent in gram-negative bacteria. Two DNA-polymerase III α-chain genes, one corresponding topolC and another to dnaE of B. subtilis, were also detected in L. lactis. In contrast, E. colihas only the dnaE gene.

Transcription machinery in both L. lactis and B. subtilis comprises some 30 genes other than the ς-factors. However, the number of ς-factors differs greatly, as there are only three in L. lactis, while there are 18 in B. subtilis, pointing to a considerable difference in the mode of gene-expression regulation in the two organisms. Translation machinery comprises 119 genes in L. lactis and 131 genes in B. subtilis. There are no duplicated aminoacyl-tRNA synthetase genes in L. lactis, while there are three (for threonine, tyrosine, and histidine) in B. subtilis. Posttranslational protein modification genes mostly differ, as there are 27 such genes in B. subtilis and only 10 in L. lactis. A particular regulation of translation might also operate in L. lactis. As discussed more fully below, all the late competence genes of L. lactisseem to be controlled by a mechanism relaying on leaderless mRNAs and, thus, on a particular mode of translation. Recent evidence shows that the involvement of translation initiation factor 3, present in all bacteria, in start codon recognition is important for restriction of translation in such systems (Tedin et al. 1999). This provides a link between regulation of translation and competence in L. lactis. Such interaction has not been detected previously.

Analysis of homology allowed us to assign regulatory functions to 138 genes, half of which were classified further by their similarity to regulatory proteins of known families. The overall number of regulatory systems is about twofold lower in L. lactis than in B. subtilis, but the proportion of these genes is similar in the two organisms. Among the interesting differences is a much lower number of the two-component signal transducers in L. lactis than inB. subtilis (eight instead of 34) and of ς-factors (three instead of 18), both of which regulate complex responses to changing environmental conditions.

Energy Metabolism and Transporters

The most important industrial applications of L. lactis are based on its energy metabolism, which leads mainly to the production of high amounts of lactic acid (homolactic fermentation). Anaerobic glycolysis is the principal energy-generating process in L. lactis, and very little of the fermented sugar (∼5%) is used for synthetic reactions (Poolman 1993). All the genes required for the conversion of the glucose to pyruvate are present in the genome. The pyruvate is converted into lactic acid, thus allowing the oxidation of reduced NAD, and the lactate dehydrogenase gene ldh, essential for this process, was studied intensely (Griffin et al. 1992). Three other genes, highly similar to ldh, (ldhB, ldhX and hicD) are present in the genome, but their role is not known. The product of the last gene has a high similarity (42% identity) to hydroxyisocaproate dehydrogenase and may, therefore, be involved in the catabolism of branched-chain amino acids. Lactate is transported into the growth medium, causing the efflux of protons and, thus, providing transmembrane potential indispensable for growth and energy recycling (Ten Brink et al. 1985).

Genome analysis indicates that the full citric acid cycle, gluconeogenesis enzymes, and many anaplerotic reactions do not exist inL. lactis. Unexpectedly, the functions necessary for aerobic respiration are encoded in the genome. L. lactis hasmen and cytABCD operons, encoding proteins required for menaquinone synthesis and cytochrome d biogenesis. It also has three genes involved in the late steps of heme synthesis (hemH, hemK, and hemN, required for oxidation of porphyrinogen and attachment of iron to heme) but not the genes required for the early steps. L. lactis may thus be able to carry out oxidative phosphorylation if the protoporphyrinogen is provided. Indeed, improved growth properties in media containing hemin were observed for certain Streptococci (Sijpesteijn 1970; Mickelson 1972). The genome analysis thus suggests the existence of aerobic respiration in this bacterium, generally considered an exclusively fermentative microorganism.

Use of L. lactis in the food industry also exploits its ability to form fermentation products other than lactate (mixed acid fermentation). The balance of products depends on activities of enzymes that act on the key metabolite generated by glycolysis, the pyruvate. A number of genes encoding such enzymes (pyruvate dehydrogenase,pdhABCD; α-acetolactate synthase, als; pyruvate-formate lyase, pfl; and lactate dehydrogenase,ldh) have been identified previously in L. lactis and confirmed by genome analysis. We detected a novel gene, poxL, encoding pyruvate oxidase, which also acts on pyruvate and might, therefore, play a role in switching between different fermentation modes.

Besides gene activity, the availability of cofactors, such as NADH and FAD, also affects the balance of different fermentation products. Artificial changing of NADH/NAD ratio in L. lactis can redirect carbon flow from lactic acid to acetoin and diacetyl (Lopez de Felipe et al. 1998). There are more than five NADH dehydrogenase genes in the L. lactis genome, which may affect the type of fermentation products. Some NADH dehydrogenases generate hydrogen peroxide, which is toxic for the cells. L. lactis has no gene encoding catalase, which can remove the toxic H2O2. However, there is a gene encoding thiol peroxidase (tpx) and two genes (ahpC and ahpF) encoding alkyl hyperoxide reductases. These proteins could possibly act on H2O2. Active sodA, encoding superoxid dismutase, which converts oxygen radicals to H2O2, was shown to be important for the oxidative stress response (Sanders et al. 1995). Also, the gshR gene encoding glutathion reductase may be involved in response of L. lactis to the aerobic growth conditions.

The heterofermentative metabolism takes place in L. lactiswhen pentoso-phosphate pathway is active, as in this case, glycolysis generates not only a three-carbon compound that can be converted to lactate but also a two-carbon compound. We detected glucose-6P dehydrogenase (zwf), phosphogluconate dehydrogenase (gnd), and ribuloso-5P epimerase (rpe), which can lead to the formation of xyluloso-5P. Phosphoketolase, encoded byptk gene, can catalyze formation of glyceraldehyde-3P and acetyl-P, which enter the fermentation pathways that yield lactate and ethanol, respectively.

Understanding the molecular basis of the switch between different fermentation types is of interest not only for standard uses of L. lactis but also for the metabolic engineering in this organism, aiming to enhance synthesis of certain metabolites to industrially useful levels. We detected a correlation between the presence of the phosphoenolpyruvate dependent transport system (PTS) and the fermentation profile for a given carbon source. PTS systems for fructose, mannose, sucrose or trehalose, mannitol, and cellobiose are present in the genome, and the homolactic fermentation profiles were reported for growth on fructose, mannose, glucose (which uses mannose or mannitol PTS) and sucrose (Cocaign-Bousquet et al. 1996). In contrast, mixed acid or heterofermentation profiles were observed for growth on galactose, xylose, maltose, gluconate, ribose, and lactose, which are not imported by a PTS system. When L. lactis cells harbor a plasmid encoding lactose-specific PTS system, lactose fermentation becomes homolactic (Gasson 1983). Our genome analysis thus strengthens the proposal that sugar consumption rate, which is the highest when PTS system is available, determines the ability for efficient homolactic fermentation (Cocaign-Bousquet et al. 1996). The correlation of information derived from genome analysis with experimental data on fermentation product distribution indicates that critical parameters regulating the final product balance may be found by a thorough analysis of the carbon source use and transport systems.

Proteases and Amino Acid Catabolism Genes

Proteases and peptidases provide a selective advantage for bacteria growing in milk, as this medium is rich in caseins and relatively poor in free amino acids. Amino acid catabolism has an impact on fermentation regulation and on the flavor of dairy products.

Genome sequence revealed 19 protease-encoding genes (Table 1). These include the membrane protease HtrA, which is responsible for degradation of the precursors of foreign exported proteins (Pouquet et al. 2000). Some 16 peptidases from LAB were characterized previously, including the products of 13 genes detected in L. lactis(Christensen et al. 1999).

Catabolism of amino acids usually starts by deamination. Arginine catabolic genes, organized in an operon near 2110 kb, encode the enzymes for the deaminase pathway as well as the arginine tRNA synthetase, suggesting complex regulation. Another operon for arginine catabolism, near 1755 kb, contains genes arcC3 andotcA. It could have a regulatory function, as it also contains the genes llrH and yrfE, representing a signal transduction system of a new type. Aspartate aminotransferase (aspC) and asparaginase (ansB) are involved in aspartate and asparagine catabolism. No genes for aspartate decarboxylase or aspartase were detected, although such enzymatic activities were identified in Lactobacillus, another prominent group of LAB (Rollan et al. 1985). Recent studies on catabolism and biosynthesis of glutamate in L. lactis identified the existence of a pathway leading to the production of γ-aminobutyrate (GABA; Sanders et al. 1998). We identified gadRCB operon for GABA production, gltBD genes for glutamate synthase, and an operon involved in citric acid metabolism: pycA,gltA, citB, and icd. Under appropriate physiological conditions, products of some of these genes might carry out glutamate catabolism, rather than biosynthesis. Serine can be directly converted to pyruvate by serine dehydratase encoded by thesdaAB operon.

Genome sequence provides inventory of 12 aminotransferases, of which some can initiate degradation of aromatic, branched-chain, and sulfur-containing amino acids, important for cheese flavor. The specificity of seven aminotransferases (aspC, serC, argD, glmS, hisC, aspB, andarcT) can be predicted from sequence comparisons, whereas those of other five (araT, nifZ, yeiG, bcaT, and ytjE) are less obvious. It was recently shown that araT and bcaT are involved in the degradation of aromatic and branched-chain amino acids, respectively (Yvon et al. 2000). The product of ytjE might be specific for methionine, as the gene is cotranscribed with the relevant biosynthesis genes. Degradation of tryptophane seems to proceed via indole aldehyde because of indole pyruvate decarboxylase gene ipd. It is not clear which pathways L. lactis uses to catabolize phenylalanine and tyrosine. It is possible that phenyl pyruvate and p-OH-phenyl pyruvate are degraded further by decarboxylation. This would depend on the specificity of the phenolic acid decarboxylase encoded by pdc.

Amino Acid, Vitamin, and Nucleotide Biosynthesis

L. lactis requires certain metabolites in the growth medium, although it has a genetic potential to synthesize some of them. Synthetic medium for L. lactis should contain at least six amino acids (isoleucine, valine, leucine, histidine, methionine, and glutamic acid) and seven vitamins (biotin, pyridoxal, folic acid, riboflavin, nicotinamide, thiamine, and pantothenic acid; Jensen and Hammer 1993). L. lactis has the genes to synthesize the 20 standard amino acids and at least four cofactors (folic acid, menaquinone, riboflavin, and thioredoxin). One reason for the requirement of the compounds that can potentially be synthesized is that some of the existing genes are not functional, as was reported previously for amino acid biosynthesis genes (Godon et al. 1993). We carefully checked sequencing tracks for the genes that could contain a frameshift mutation and could not rule out the presence of a mutation in 30 of them. This relatively high level of pseudogenes in IL1403 could possibly be, at least in part, caused by the treatments used to cure the parental strain of its plasmids (Chopin et al. 1984).

Milk does not contain sufficient levels of purine compounds to support growth of L. lactis and, therefore, de novo biosynthesis is necessary (Dickely et al. 1995). We detected 57 genes involved in this metabolism. Therefore, physiological and genomic evidence shows thatL. lactis has sufficient and fairly active capacities for biosynthesis and also for salvage of nucleic acid compounds.

Cell Wall Metabolism

Many L. lactis properties that are important for applications, such as phage sensitivity, stress resistance, autolysis, and mucosal immunostimulation, depend on the structure of the cell wall. There are 29 genes encoding enzymes required for the synthesis of the main cell wall component, peptidoglycan. Among these, three encode amino acid racemases: dal for alanine, murI for glutamate, and racD for aspartate. D-alanine and D-glutamate are the components of linear peptide moiety of peptidoglycan, whereas D-aspartate forms cross-bridges. There are no genes for synthesis of modified peptidoglycan, containing D-lactate or D-serine instead of D-alanine, reported for several other LAB.

Cheese ripening can be accelerated by induction of enzymes that process peptidoglycan. There are six genes related to such processing inL. lactis: dacA and dacB, encoding alanine–alanine carboxypeptidase; and acmA, B, C, and D, encoding four lysozymes. Carboxypeptidases alone cannot cause the cell lysis, as their activity does not destabilize the wall. Modulation of the level of their production can, however, influence the action of lysozymes.acmA, responsible for separation of daughter cells, was used for artificial induction of autolysis (Buist et al. 1997).

Lipoteichoic acid is another main component of the L. lactiscell wall. Neither teichoic nor teichuronic acids were detected in this microorganism (Valyasevi et al. 1990). However, there is a cluster of seven tag genes near 950 kb. Only three genes from teichuronic acid biosynthesis pathway were found: ycbK, ycbF, andycbH, corresponding to tuaB, tuaC, andtuaG of B. subtilis. dlt operon, encoding D-alanylation of lipoteichoic acid, is of crucial importance for properties of the cell wall and whole-cell physiology. A knockout mutation in dltD causes filamentous growth and UV sensitivity and facilitates penetrability of the cells (Duwat et al. 1997).

Synthesis of extracellular polysaccharides is important for the industrial use of many LAB, as these polymers affect the texture of the fermented products. There are >20 genes involved in the biosynthesis of such molecules in the region near 200 kb. They encode functions providing activated sugars and other components involved in production of surface or extracellular polysacharide. A plasmid that carries an operon involved in the formation of the repeating unit, linking activated sugar to the lipid carrier, export, and polymerization, was recently identified (Van Kranenburg et al. 1997). Conjunction of plasmid-carried and chromosomal functions presumably determine the amounts and the structure of extracellular polysaccharides.

Protein Secretion

L. lactis has only eight genes identified as implicated in protein secretion. Contrary to B. subtilis and E. coli, this bacterium does not have secDF genes, known to improve the secretion efficiency (Pogliano and Beckwith 1994; Bolhuis et al. 1998). There is only one membrane protease, HtrA, involved in degradation of hybrid exported proteins (Pouquet et al. 2000). GenepmpA (protein maturation protein) encodes a homolog of PrsA from B. subtilis and might be involved in stabilization of secreted proteins by facilitating their folding. L. lactis was shown to secrete up to 20 mg/L of foreign protein with optimized gene constructs (Le Loir et al. 1998). This value could possibly be improved by manipulating the gene expression levels and supplying the missing components of the secretion machinery.

Competence to Genetic Transformation

Natural competence to DNA transformation was not demonstrated inL. lactis. We detected four operons (comE,comF, comC, and comG) containing genes similar to the late competence genes from B. subtilis andS. pneumoniae. In addition, we found a gene for ComX, which is similar to the S. pneumoniae ECF-type ς-factor required for transcription of the competence genes (Lee and Morrison 1999). The regions preceding the first ORF of the four operons resemble competence promoters from S. pneumoniae and might be transcribed by ComX. There are three common sequences in front of all competence operons, two of which, GTTACATT and TTTTCGTATA, are in the −35 and −10 domains of the promoter, while the third, AGTATG, includes the ATG start codon of the first gene in each operon. The relative position of the three conserved elements indicates that all mRNAs start at the ATG codon of the first gene and are, therefore, leaderless, lacking the canonical ribosome-binding site. Search for the consensus sequence over the whole genome, using PatScan (Dsouza et al. 1997), revealed six such promoters other than those of the late competence operons. The genes downstream of these promoters are radA, coiA,dprA, recQ, ssbA, and yqfG. Only the radA gene, encoding a DNA repair protein, has leaderless mRNA. Three of the genes, coiA, dprA, andrecQ, affect DNA transformation in S. pneumoniae,H. influenzae, and B. subtilis, respectively (Karudapuram et al. 1995; Fernandes et al. 1998; Pestova and Morrison 1998). ssbA encodes single-strand DNA-binding protein and could be involved in the processing of transforming DNA, which enter gram-positive bacteria in the single-stranded form. yqfGencodes a protein of unknown function. The existence of the competence-related genes in L. lactis indicates that this bacterium might be naturally transformable by DNA. There are no genes homologous to those involved in early steps of competence development in S. pneumoniae, which indicates that, in L. lactis, the regulation cascade upstream of ComX ς-factor is very different from that in Streptococcae.

Another difference between L. lactis and S. pneumoniae competence systems is that the leaderless mRNAs are present in the former organism only. The translation of such mRNAs requires that they start precisely at the initiation codon of the gene (Kravchenko et al. 1988; Van Etten and Janssen 1998). Synthesis of competence-related proteins would, therefore, not take place on spurious transcription of the cognate genes by leakage from upstream operons. This might tighten the control of the competence development and does limit it to very strict environmental conditions.

Horizontal Gene Transfer between Lactococci and Gram-Negative Enteric Bacteria

We detected a gene of unknown function, designated ycdB, which appears to be present in all bacteria and some eukaryotes. The level of identity between the YcdB protein and a homolog from S. pyogenes or S. pneumococcus, phylogenetically close toL. lactis, is ∼80%, while the identity with the homologous genes from gram-negative bacteria is ∼40%. Very surprisingly, theE. coli and S. typhimurium genomes encode not only a protein that is 40% identical with YcdB but also a protein that is 94% identical to YcdB. We conclude that this second ycdB gene has been transferred from lactococci to enteric bacteria. The divergence of the synonymous nucleotide sites in L. lactisIL1403, compared with Salmonella and E. coli, is ∼10%. If the rate of nucleotide changes at such sites is ∼1% per million years (Ochman et al. 1999), the genes inSalmonella/E. coli and L. lactis IL1403 started to diverge 10 million years ago. However, comparison of theycdB genes in different strains of lactococci and in gram-negative enteric bacteria may reveal even more closely related genes and allow us to better assess the time of the gene transfer, the species that may have been involved in the transfer, and the mechanism of the transfer. Nevertheless, anticipating that closer homologs will be found, it is tempting to speculate that the transfer may have taken place in the digestive tract of ruminants, if it involved wild-type lactococci, or of humans, if it involved the domesticated lactococci, massively introduced there by cheese consumption.

Analysis of completely sequenced genomes, available from the NCBI server, revealed that most bacteria have only one homolog to YcdB. Some (E. coli, S. typhimurium, B. subtilis,E. faecalis, and Shewanella putrefaciens), however, have two, indicating that the family might be undergoing an expansion where, at least for enteric bacteria, a lateral gene transfer from lactococci might be a driving force. As the function of this gene is unknown, the advantage that the second copy confers is not known. Elucidation of the gene function would help to answer this question.

METHODS

Genome Cloning, Sequencing, and Data Verification

The strain IL1403 is a plasmid-free derivative of the strain IL594, isolated from a cheese starter culture (Chopin et al. 1984). Diagnostic sequencing, involving 10,235 sequencing reactions and yielding a total of 4,687,630 bases, has been described previously (Bolotin et al. 1999). Further sequencing was carried out to assure us that each nucleotide in the genome was read at least four times and at least once on each strand. For this purpose, a collection of short insert clones was constructed. A total of 9,888,620 bases, covering 93% of the total genome, were produced by 15,578 more sequencing reactions. To reduce the error rate level to <0.01%, 978 more reactions, with average read length of 632 bases, were carried out using genome-specific primers. The redundancy of the final assembling is 6.44.

Informatics and Gene Nomenclature

Assembling manual corrections of sequencing errors and consensus generation were carried out concurrently with data accumulation, using the XBAP program (Dear and Staden 1991; version 14.0). To predict protein-coding regions, we used a conceptual translation of the whole genome in six possible coding frames. The predicted proteins >60 amino acids were checked for the statistical consistency with the output of the GENMARK program (Borodovsky and McIninch 1993) using parameters for Streptococcal genes. EBI server (http://www2.ebi.ac.uk/genemark) and pyogenes_3.xdr matrix dated November 14, 1996, were used for this analysis. The presence of a putative ribosome-binding site upstream of the 5′ end of the candidate was searched next. As a ribosome binding site, we considered the presence of initiator codon ATG, TTG, or GTG and a short sequence homologous to the 3′ end of 16S rRNA of L. lactis(5′…GGAUCACCUCCUUUCUAA 3′) upstream of it (Chiaruttini and Milet 1993). Genome annotation was done by using several homemade shell or Perl scripts, generating convenient html format tables linked toBLAST (Altschul et al. 1990) output files. NCBI server (http://www.ncbi.nlm.nih.gov/Entrez) was used to generate updated bacterial protein databases. Homology analysis of YcdB with the unpublished genome sequences was carried out by using the relevant NCBI server (http://www.ncbi.nlm.nih.gov/Microb_blast/unfinishedgenome.html). The functional classification of genes was done according to the list of categories presented earlier (Bolotin et al. 1999). Fully automatic computer-generated classification was used as the starting material. Each protein was then analyzed by an expert to improve the category assignment, which is presented in Table 1 and Figure2. The expert usually used three means to confirm or to alter the automated function assignment and classification: first, phylogenetic or COGnitor (Tatusov et al. 1997) assisted scrutiny of BLAST or FASTA reslts (performed with different parameters); second, complete knowledge of particular biochemical pathways or biological systems, existing in other than L. lactis IL1403 organisms (such as protein secretion or the competence system). Phage-specific proteins were classified to those because of their clustering in the areas identified as prophages. Also, specialized databases (Quentin et al. 1999) were used by the expert to classify the ABC transporters; third, results of numerous experiments in L. Lactis, published previously (148 functional assignments). Although it is never absolutely explicit, the provided classification of gene functions in L. lactis IL1403 is biological, rather than biochemical.

Figure 2.

Linear map of the Lactococcus lactis ssp. lactisIL1403 chromosome. Coding regions are shown as arrows color-coded to the assigned functional categories. IS-elements and rRNA genes are shown as black arrows with white designation numbers inside. Symbols shown in pink identify genes in which frameshifts were detected.

5f2_C4TT

L. lactis paralogous gene families were constructed by searching each predicted protein against all predicted proteins, usingBLASTP with different parameters. Alignments of proteins in the identified families were then scrutinized to make a decision of how many proteins belong to a family. This decision was based either on the size of homologous domains or on the similarity levels. A protein was always assigned to only one family of paralogs.

We tried to keep the same gene symbols as proposed by the previous authors for ORFs with functions experimentally confirmed in L. lactis (148 genes). A y prefix with the gene symbol consistent with its position on the chromosome (Fig. 2) was kept for unascertained functions (1149 genes). Other gene symbols, consistent with those for homologs found in other bacteria, are proposed here (1017 genes).

Accessibility of Data

The nucleotide sequence of the L. lactis IL1403 genome is available from NCBI with accession no. AE005176. Updated annotations are supported at the Génétique Microbienne (INRA) server athttp://spock.jouy.inra.fr. A PatScan of Ross Overbeek (Dsouza et al. 1997) for pattern searches in DNA sequence and proteins, implemented for IL1403, and peptide spectrum identification tool PeptOko for L. lactis proteome research are also available from this server.

We thank Jacek Bordovski and Saulius Kulakauskas for giving samples of the L. lactis IL1403 strain and Marie-Christine and Alain Chopin, Patrick Duwat, Emmanuel Jamet, Alexandra Gruss, Emmanuelle Maguin, Isabelle Poquet, Pierre Renault, and Catherine Robert for helpful discussions. We thank also the Genome Centers that contributed to the Unfinished Microbial Genome Database available forBLAST search through the NCBI server (http://www.ncbi.nlm.nih.gov/Microb_blast/unfinishedgenome.html).

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 USC section 1734 solely to indicate this fact.

Notes

[2] Present address: Génétique des Poissons, INRA, Domaine de Vilvert, 78352 Jouy en Josas CEDEX, France.

[3] Corresponding author.

Notes

[4] E-MAIL [email protected]; FAX 33-1-34-65-25-21.

[5] Article published on-line before print: Genome Res.,10.1101/gr.169701.

[6] Article and publication are at www.genome.org/cgi/doi/10.1101/gr.169701.

REFERENCES

  1. S.F. AltschulW. GishW. MillerE.W. MyersD.J. Lipman(1990) Basic local alignment search tool. J. Mol. Biol. 215:403–410.
  2. A. BolhuisC.P. BroekhuizenA. SorokinM.L. van RoosmalenG. VenemaS. BronW.J. QuaxJ.M. van Dijl(1998) SecDF of Bacillus subtilis, a molecular Siamese twin required for the efficient secretion of proteins. J. Biol. Chem. 273:21217–21224.
  3. A. BolotinS. MaugerK. MalarmeS.D. EhrlichA. Sorokin(1999) Low-redundancy sequencing of the entire Lactococcus lactis IL1403 genome. Antonie Leeuwenhoek 76:27–76.
  4. M. BorodovskyJ. McIninch(1993) GENMARK: A parallel gene recognition for both DNA strands. Comput. Chem. 17:123–133.
  5. G. BuistH. KarsensA. NautaD. van SinderenG. VenemaJ. Kok(1997) Autolysis of Lactococcus lactis caused by induced overproduction of its major autolysin, AcmA. Appl. Environ. Microbiol. 63:2722–2728.
  6. C. ChiaruttiniM. Milet(1993) Gene organization, primary structure and RNA processing analysis of a ribosomal RNA operon in Lactococcus lactis. J. Mol. Biol. 230:57–76.
  7. A. ChopinM.C. ChopinA. Moillo-BattP. Langella(1984) Two plasmid-determined restriction and modification systems in Streptococcus lactis. Plasmid 11:260–263.
  8. A. ChopinA. BolotinA. SorokinS.D. EhrlichM.C. Chopin(2001) Analysis of six prophages in Lactococcus lactis IL1403: Different genetic structure of temperate and virulent phage populations. Nucleic Acids Res. 29:644–651.
  9. J. ChristensenE. DudleyJ. PedersonJ. Steele(1999) Peptidases and amino acid catabolism in lactic acid bacteria. Antonie Leeuwenhoek 76:217–246.
  10. M. Cocaign-BousquetC. GarriguesP. LoubiereN.D. Lindley(1996) Physiology of pyruvate metabolism in Lactococcus lactis. Antonie Leeuwenhoek 70:253–267.
  11. B. CousineauD. SmithS. Lawrence-CavanaghJ.E. MuellerJ. YangD. MillsD. ManiasG. DunnyA.M. LambowitzM. Belfort(1998) Retrohoming of a bacterial group II intron: Mobility via complete reverse splicing, independent of homologous DNA recombination. Cell 94:451–462.
  12. S. DearR. Staden(1991) A sequence assembly and editing program for efficient management of large projects. Nucl. Acids Res. 19:3907–3911.
  13. F. DickelyD. NilssonE.B. HansenE. Johansen(1995) Isolation of Lactococcus lactis nonsense suppressors and construction of a food-grade cloning vector. Mol. Microbiol. 15:839–847.
  14. M. DsouzaN. LarsenR. Overbeek(1997) Searching for patterns in genomic data. Trends Genet. 13:497–498.
  15. P. DuwatA. CochuS.D. EhrlichA. Gruss(1997) Characterization of Lactococcus lactis UV-sensitive mutants obtained by ISS1 transposition. J. Bacteriol. 179:4473–4479.
  16. S. FernandezA. SorokinJ.C. Alonso(1998) Genetic recombination in Bacillus subtilis 168: Effects of recU and recS mutations on DNA repair and homologous recombination. J. Bacteriol. 180:3405–3409.
  17. P.F. Fox(1989) Cheese: An overview. in Cheese: Chemistry, Physics and Microbiology, ed P.F. Fox(Chapman & Hall, London), pp 1–36.
  18. C.M. FraserR.D. Fleischmann(1997) Strategies for whole microbial genome sequencing and analysis. Electrophoresis 18:1207–1216.
  19. M.J. Gasson(1983) Plasmid complements of Streptococcus lactis NCDO 712 and other lactic streptococci after protoplast-induced curing. J. Bacteriol. 154:1–9.
  20. J.J. GodonC. DelormeJ. BardowskiM.C. ChopinS.D. EhrlichP. Renault(1993) Gene inactivation in Lactococcus lactis: Branched-chain amino acid biosynthesis. J. Bacteriol. 175:4383–4390.
  21. H.G. GriffinS.R. SwindellM.J. Gasson(1992) Cloning and sequence analysis of the gene encoding L-lactate dehydrogenase from Lactococcus lactis: Evolutionary relationships between 21 different LDH enzymes. Gene 122:193–197.
  22. P.R. JensenK. Hammer(1993) Minimal requirements for exponential growth of Lactococcus lactis. Appl. Environ. Microbiol. 59:4363–4366.
  23. S. KarudapuramX. ZhaoG.J. Barcak(1995) DNA sequence and characterization of Haemophilus influenzae dprA +, a gene required for chromosomal but not plasmid DNA transformation. J. Bacteriol. 177:3235–3240.
  24. V.V. KravchenkoI.P. GilevaV.N. DobryninS.A. FilipovV.G. Korobko(1988) Location of the initiation codon AUG in relation to the 5′-end of mRNA mediates the effectiveness of translation in E. coli cells. Bioorg. Khim. 14:1387–1392.
  25. F. KunstN. OgasawaraI. MoszerA.M. AlbertiniG. AlloniV. AzevedoM.G. BerteroP. BessieresA. BolotinS. Borchert(1997) The complete genome sequence of the gram-positive bacterium Bacillus subtilis. Nature 390:249–256.
  26. P. Le BourgeoisM. LautierM. MataP. Ritzenthaler(1992) Physical and genetic map of the chromosome of Lactococcus lactis subsp. lactis IL1403. J. Bacteriol. 174:6752–6762.
  27. P. Le BourgeoisM. LautierL. van den BergheM. GassonP. Ritzenthaler(1995) Physical and genetic map of the chromosome of Lactococcus lactis subsp. cremoris MG1363 chromosome: Comparison with that of Lactococcus lactis subsp. lactis IL1403 reveals a large genome inversion. J. Bacteriol. 177:2840–2850.
  28. M.S. LeeD.A. Morrison(1999) Identification of a new regulator in Streptococcus pneumoniae linking quorum sensing to competence for genetic transformation. J. Bacteriol. 181:5004–5016.
  29. Y. Le LoirA. GrussS.D. EhrlichP. Langella(1998) A nine-residue synthetic propeptide enhances secretion efficiency of heterologous proteins in Lactococcus lactis. J. Bacteriol. 180:1895–1903.
  30. F.L. Lopez de FelipeM. KleerebezemW.M. de VosJ. Hugenholtz(1998) Cofactor engineering: A novel approach to metabolic engineering in Lactococcus lactis by controlled expression of NADH oxidase. J. Bacteriol. 180:3804–3808.
  31. M.N. Mickelson(1972) Glucose degradation, molar growth yields, and evidence for oxidative phosphorylation in Streptococcus agalactiae. J. Bacteriol. 109:96–105.
  32. H. OchmanS. ElwynN.A. Moran(1999) Calibrating bacterial evolution. Proc. Natl. Acad. Sci. 96:12638–12643.
  33. E.V. PestovaD.A. Morrison(1998) Isolation and characterization of three Streptococcus pneumoniae transformation-specific loci by use of a lacZ reporter insertion vector. J. Bacteriol. 180:2701–2710.
  34. J.A. PoglianoJ. Beckwith(1994) SecD and SecF facilitate protein export in Escherichia coli. EMBO J. 13:554–561.
  35. B. Poolman(1993) Energy transduction in lactic acid bacteria. FEMS Microbiol. Rev. 12:125–147.
  36. I. PouquetV. SaintE. SeznecN. SimoesA. BolotinA. Gruss(2000) HtrA in the unique surface housekeeping protease in Lactococcus lactis and is required for natural protein processing. Mol. Microbiol. 35:1042–1051.
  37. Y. QuentinG. FichantF. Denizot(1999) Inventory, assembly and analysis of Bacillus subtilus ABC transport systems. J. Mol. Biol. 287:467–484.
  38. G. RollanM.C.M. de NadraP.R. HolgadoG. Oliver(1985) Aspartate metabolism in Lactobacillus murinus CNRS 313. I. Aspartase. J. Gen. Appl. Microbiol. 31:403–409.
  39. J.W. SandersK.J. LeenhoutsA.J. HaandrikmanG. VenemaJ. Kok(1995) Stress response in Lactococcus lactis: Cloning, expression analysis, and mutation of the lactococcal superoxide dismutase gene. J. Bacteriol. 177:5254–5260.
  40. J.W. SandersK. LeenhoutsJ. BurghoornJ.R. BrandsG. VenemaJ. Kok(1998) A chloride-inducible acid resistance mechanism in Lactococcus lactis and its regulation. Mol. Microbiol. 27:299–310.
  41. K.H. SchleiferJ. KrausC. DvorakR. Kilpper-BälzM.D. CollinsW. Fischer(1985) Transfer of Streptococcus lactis and related streptococci to the genus Lactococcus gen. nov. Syst. Appl. Microbiol. 6:183–195.
  42. A.K. Sijpesteijn(1970) Induction of cytochrome formation and stimulation of oxidative dissimilation by hemin in Streptococcus lactis and Leuconostoc mesenteroides. Antonie Leeuwenhoek 36:335–348.
  43. A. SorokinA. LapidusV. CapuanoN. GalleronP. PujicS.D. Ehrlich(1996) A new approach using multiplex long accurate PCR and yeast artificial chromosomes for bacterial chromosome mapping and sequencing. Genome Res. 6:448–453.
  44. L. SteidlerW. HansL. SchotteS. NeirynckF. ObermeierW. FalkW. FiersE. Remaut(2000) Treatment of murine colitis by Lactococcus lactis secreting interleukin-10. Science 289:1352–1355.
  45. R.L. TatusovE.V. KooninD.J. Lipman(1997) A genomic perspective on protein families. Science 278:631–637.
  46. K. TedinI. MollS. GrillA. ReschA. GraschopfC.O. GualerziU. Blasi(1999) Translation initiation factor 3 antagonizes authentic start codon selection on leaderless mRNAs. Mol. Microbiol. 31:67–77.
  47. B. Ten BrinkR. OttoU.P. HansenW.L. Konings(1985) Energy recycling by lactate flux in growing and non-growing cells of Streptococcus cremoris. J. Bacteriol. 162:383–390.
  48. H. TettelinN.J. SaundersJ. HeidelbergA.C. JeffriesK.E. NelsonJ.A. EisenK.A. KetchumD.W. HoodJ.F. PedenR.J. Dodson(2000) Complete genome sequence of Naisseria meningitidis serogroup B strain MC58. Science 287:1809–1815.
  49. R. ValyaseviW.E. SandineB.L. Geller(1990) The bacteriophage kh receptor of Lactococcus lactis subsp. cremoris KH is the rhamnose of the extracellular wall polysaccharide. Appl. Environ. Microbiol. 56:1882–1889.
  50. W.J. Van EttenG.R. Janssen(1998) An AUG initiation codon, not codon-anticodon complementarity, is required for the translation of unleadered mRNA in Escherichia coli. Mol. Microbiol. 27:987–1001.
  51. R. Van KranenburgJ.D. MaruggI.I. van SwamN.J. WillemW.M. de Vos(1997) Molecular characterization of the plasmid-encoded eps gene cluster essential for exopolysaccharide biosynthesis in Lactococcus lactis. Mol. Microbiol. 24:387–397.
  52. M. YvonE. ChambellonA. BolotinF. Roudot-Algaron(2000) Characterisation and role of the branched-chain aminotransferase (BcaT) isolated from Lactococcus lactis subsp. cremoris NCDO763. Appl. Environ. Microbiol. 66:571–577.
Loading
Loading
Loading
Back to top