Evolution of Eukaryotic Transcription: Insights From the Genome of Giardia lamblia

Fig. S3
Best, A.A.

TBP Alignment Showing Exceptions to Consensus


                                                                                        *                     *
M. jannaschii                                 MEPEIKIVNVV
V
STKIGDNIDLEEVAMI--LENAE--Y
E
PEQFPGL--VCRLSVPKV---ALLIFRSGKV
N
CTGAKSKEEAEIAIKKIIKELKDAG-ID-
A. fulgidus                                  MQDYKIKIENVVASTQIGENIDLNKISRE--IKDSE--Y
K
PKQFPGL--VLR
T
KEPKA---AAL
V
FRSGKVVCTGSKSVEDARRAVKQIVKMLKEIG-IS-
M. thermoautotrophicum                       MTDVDIKIENIVASATLGKSIDLQTVAEA--LENVD--
F
N
R
EQFPGL--VY
K
LKEPKT---AALIFGSGKLVCTGAKSIEDSKRAIKLTV
D
MMRTMD-PD-
P. woesei                                 MVDMSKVKLRIENIVASVDLFAQLDLEKVLDL--CPNS
K
--YNPE
E
FPG
I
--IC
H
LDDPKV---ALLIFSSGKLVVTGAKSVQDIERAVAKLA
Q
KLKSIG-VK-
S. solfataricus                                         
M
ATVTLEQSLDLYA
M
ERS--IPNIE--Y
D
PDQFPGL--IFRLEQPKV---TALIFKSGKMVVTGAKSTEELIKAVKRIIKTLKKYG-IK-
A. pernix                MAVVSEEISFVKEIDTGVEGLPKPEVKIENIVATVILENQLDLNLIETK--IQDVD--YNPDQFPGL--VYRLESPRV---T
V
LIFKSGKMVITGAKSINQLIH
V
VKKLLKAFADQG-IP-
A. thaliana                 MADQGTEGSQPVDLTKHPSGIVPTLQNIVSTVNLDCKLDLKAIALQ--ARNAE--YNPKRFAAV--IMRIREPKT---TALIFASGKMVCTGAKSEHLSKLAARKYARIVQKLG-FP-
S. pombe              ...NEATNETADSGDAEVSKNEGVSGIVPTLQNIVATVNLDCRLDLKTIALH--ARNAE--YNPKRFAAV--IMRIREPKS---TALIFASGKMVV
L
GGKSEDDSKLASRKYARIIQKLG-FN-
S. cerevisiae         ...FQSEEDIKRAAPESEKDTSATSGIVPTLQNIVATVTLGCRLDLKTVALH--ARNAE--YNPKRFAAV--IMRIREPKT---TALIFASGKMVVTGAKSEDDSKLASRKYARIIQKIG-FA-
H. sapiens            ...---SPMTPMTPITPATPASESSGIVPQLQNIVSTVNLGCKLDLKTIALR--ARNAE--YNPKRFAAV--IMRIREPRT---TALIFSSGKMVCTGAKSEEQSRLAARKYARVVQKLG-FP-
D. melanogaster       ...NIHQTMGPSTPMTPATPGSADPGIVPQLQNIVSTVNLCCKLDLKKIALH--ARNAE--YNPKRFAAV--IMRIREPRT---TALIFSSGKMVCTGAKSEDDSRLAARKYARIIQKLG-FP-
T. thermophila        ...DQNKNKNNILSTIETMDKSISEDLYPKLQNIVSTVNLSTKLDLKQIALR--ARNAE--YNPKRFAAV--IMRLRDPKT---TALIFASGKMVCTGAK
T
EEDSNRAARKYAKIIQKIG-FP-
D. discoideum             MSTATTTSTPAQNVDLSKHPSGIIPTLQNIVSTVN
M
ATEL
Y
LKAIALG--ARNAE--YNPKRFAAV--IMRIREPKT---TALIFKSGKMVCTGAKSEDASRFAARKYARIIQKLD-FP-
N. locustae           ...RMDAPDLSRELEIKGQDMYRKSDILPALQNVVATVNLNCKLDLKAIALR--ARNAE--YNPKRFAAV--IMRIRDPKT---TALIFASGKMVVTGAKSEQTSKLAAQKFSRIIHKLG-FN-
E. histolytica        ...NICHAVICQLQLSHKKVLIIQTITHPEIVNVVS
R
FQLGVKL
E
LRKIVQK--AINA
I
--YNPKRFAG
A
--IMRISSPKS---TALIFQ
T
GKIVCTGTRSIEESKIASKKYAKIIKKIG-YP-
G. theta nucleomorph  ...RTRGPSTEAQSSVVPFRAVRANEITPNIQNVVSTVSLGIQLDLKRIALK--ARNAE--YNPRRFAAV--IMRIRDPKT---TALIFSSGKMVVTGAKSEDSARVACKKYARIIQRLG-YG-
A. castellanii        ...PAQSTAASDDMDSDVDRTKHPSGIVPTLQNIVSTVNLGCKLDLKNIALH--ARNAE--YNPKRFAAV--IMRIREPKT---TALIFASGKMVCTGAKSEEASRLAARKYARIIQKLG-FA-
P. falciparum         ...TSEYDNNEKEKSDDLKNKLVHKNISLNIHNI
I
SSANLCIDI
N
LRLVAVS--IRNAE--YNPS
KINT
L--IIRLNKP
Q
C---TALIFK
N
G
R
I
M
LTGTR
T
KKDSIM
G
CKKIAKIIKIV
T
-KD-
G. lamblia                       MSSPGPSNIELAVQGLSVK
V
V
GYNCR
FSLGFN
V
D
M
RL
L
AAS--LL
T
AD--YNP-R
Y
P
T
V--
R
VRLTSP
Q
C---
CISVS
Y
H
G
H
C
T
I
F
GC
E
SVAQAATAAAVFLKLLNEI
EE
FV
G
primary amino acid                                  l NiVst  l   lDL  i   --  n e--YNP rFaav--i Ri  Pk ---taLIF SGK V TG kS      A     r     g-  -
secondary amino acid                                i  v as  i   i    v       d d      q pgl  v  l   r    al             r             k     d    

secondary amino acid                              i v  m    n       d  v                      v     tri      v                   v   l                 
primary amino acid                                f iQNiV S d      Le  a   -     --YEPE FPGL--iYR   pkv---V LiF SGK V TGAK       a   i   L             
M. jannaschii                               VIENPEIKIQNMVA
T
ADLGIEPNLDDIALM--VEGTE--YEPEQFPGL--VYRLDDPKV---VVLIFGSGKVVITG
L
KSEEDAKRALKKILDT
I
KEVQEL       
A. fulgidus                                 VIDEPE
V
KVQNIVASADLGVDLNL
N
AIAIGL
G
LENIE--YEPEQFPGL--VYRLDNPRV---VVLIFGSGKMVVTG
G
KSPEDARKAVERISEELRTLGLM       
M. thermoautotrophicum                      IPEEFEIKIQNIVASANLGKPLNLEAVALG--LENTE--YEPEQFPGL--VYRLDDPKV---VLL
L
FGSGKVVCTGAKSAEDAKL
G
VEK
T
KARLAELDLI       
P. woesei                                   FKRAPQIDVQNMVFSGDIGREFNLDVVALT--LPNCE--YEPEQFPG
V
--IYRVKEPK
S
---VIL
L
FSSGKIVC
S
GAKSEADAWEAVRKLLRELDKYGLLEEEEEEL
S. solfataricus                             IVGKPKIQIQNIVASANLHVNVNLDKAAFL--LENNM--YEPEQFPGL--I
F
RMDDPRV---VLLIFSSGKMVITGAKREDEVSKAVKRIFDKLAELDCVKPIE...
A. pernix                                   ISGKPQIQIQNIVASANLKVYIDLEKAALE--FENSL--YEPEQFPGL--IYRMDEPRV---VMLIFSSGKMVITGAKMENEVYDAVKK
V
ARKLKEADAIIGIAE  
A. thaliana                                 -AKFKDFKIQNIVGSCDVKFPIRLEGLAYSH-SAFSS--YEPELFPGL--IYRMKLPKI---VLLIFVSGKIVITGAKMREETYTAFENIYPVLREFRKVQQ     
S. pombe                                    -AKFTDFKIQNIVGSCDVKFPIRLEGLAYSH-GTFSS--YEPELFPGL--IYRMVKPKV---VLLIFVSGKIVLTGAKVREEIYQAFEAIYPVLSEFRKH       
S. cerevisiae                               -AKFTDFKIQNIVGSCDVKFPIRLEGLAFSH-GTFSS--YEPELFPGL--IYRMVKPKI---VLLIFVSGKIVLTGAKQREEIYQAFEAIYPVLSEFRKM       
H. sapiens                                  -AKFLDFKIQNMVGSCDVKFPIRLEGLVLTH-QQFSS--YEPELFPGL--IYRMIKPRI---VLLIFVSGKVVLTGAKVRAEIYEAFENIYPILKGFRKTT      
D. melanogaster                             -AKFLDFKIQNMVGSCDVKFPIRLEGLVLTH-CNFSS--YEPELFPGL--IYRMVRPRI---VLLIFVSGKVVLTGAKVRQEIYDAFDKIFPILKKFKKQS      
T. thermophila                              -VQFKDFKIQNIVGSTDVKFPINLDHL
E
QDH-KKFVQ--YEPEIFPG
K
--IYREFNTKI---VLLIFVSGKIVLTGAKTRENINKAFQKIYWVLYNYQKKDYRG...
D. discoideum                               -ARFTDFKIQNIVGSCDVKFPIKLELL
H
NAH-TSFTN--YEPEIFPGL--IY
K
MIQPKV---
L
LLIFVSGKIVLTGAKVREYIYEAFENIYPVLSAFKKVNAITQ  
N. locustae                                 -TKFADFKIQNIVSSCDTQFSIRLEGLAFAH-SNFCS--YEPELFPGL--IYRMVKPKI---VLLIFVSGKIVLTGAKMRDEIYEAFDNIYPVLTQYKKI       
E. histolytica                              -IHYSNFNVQNIVGSCDVKFQIAL
R
TLVDSD-LAFCQ--YEPEVFPGL--VYRMASPKV---
T
LLVFS
T
GKVVLTGAKDEESLNLAYKNIYPILLANRKEDISNQ  
G. theta nucleomorph                        HAKFIDFRIQNIVASCDVRFPIRLESLAHAH-NQFCS--YEPELFPGL--IYRMITPKV---VLLIFVSGKLVLTGAKQRNDIFQAFSNIYSVLCLYKKT       
A. castellanii                              -AKFLDFKIQNIVGSCDVRFPIRLEGLAFAH-NHYCS--YEPELFPGL--IYRMVQPKI---VLLIFVSGKIVLTGAKVREEIYEAFENIYPVLTEYKKT       
P. falciparum                               KVKFCNFKI
E
NI
I
ASANCNIPIRLEVLAHDH-KEYCN--YEPELF
A
GL--VYRYKPT
SNLKS
VILIFVSGKI
I
ITG
C
KSVNKLYTVFQDIYNVLIQYKN        
G. lamblia                                  LARPSP
L
TV
VS
I
T
C
L
TDLGHGIRLDAAAAAT
I
SVFSS
AM
Y
Q
PEI
M
P
S
L
QV
V
FK
IAE
RN
I---
C
C
S
VFA
D
G
Q
V
T
I
V
GA
R
NIFDARDVITKLYEGLFDYFIT       
                                                                                        *                     *

Figure S3 The TATA-binding protein (TBP) sequences from selected Archaea and Eucarya are aligned to show the similarities of the first and second copies of the direct repeat sequence. Consensus amino acids are shown for each alignment column with at most 3 exceptions to a single amino acid, or at most 2 exceptions to two amino acids. The exceptions to the consensus are highlighted in cyan. The conserved phenylalanine residues that intercalate into the DNA helix are also marked with asterisks on the outside of the alignment. Complete names of the taxa included are: Methanococcus jannaschii, Archaeoglobus fulgidus, Methanobacterium thermoautotrophicum, Pyrococcus woesei, Sulfolobus solfataricus, Aeropyrum pernix, Arabidopsis thaliana, Schizosaccharomyces pombe, Saccharomyces cerevisiae, Homo sapiens, Drosophila melanogaster, Tetrahymena thermophila, Dictyostelium discoideum, Nosema locustae, Entamoeba histolytica, Guillardia theta nucleomorph, Acanthamoeba castellanii, Plasmodium falciparum and Giardia lamblia. For some sequences, amino-terminal and/or carboxy-terminal residues have been omitted (indicated by ellipsis).

This Article

  1. doi: 10.1101/gr.2256604 Genome Res. August 2004 vol. 14 no. 8 1537-1547

Preprint Server