Escolar Documentos
Profissional Documentos
Cultura Documentos
TALKING
POINT TIBS 21 - M A Y 1 9 9 6
(a) (b)
66 9 9 82 472 481 A Me. therm. PNPLEYGMLDVMFSEHCSYKSSR? VLGFFPTEGEGVIIGPGDD
~-H.marismortui QDDYSVE ITIEGGAG LS P B.subtilis --YT-I-IFS--W ....... N-K- I-RK---S--R-LQ---EG
A|H.cutirubrum EE--T-A . . . . . . . . . . P La. easei --,,-,-,,,,-,----,--,-,- --RQ-W-KN-R-LM---EG
|T.acidophilum --FAA-- T--KFK ---TASTR -- N E.coli
~M.mazei -VY. . . . EAM-K-T -S-QKPG- -- E S.cerevi,
~My.leprae --R-V-- S-W-I- -K-QE-S- -- E D.melano.
I My.tuberculosis -VR-V-- S-W-I- -R-QE-S- --
l St.griseus --R-V-- T-WKID M-VT--SS -P
p|C.perfringens --M . . . . T--K-N ---TASTN -- A Me.therm. AGWEV TDELANA IG I ESHNH PSA IEPYGGAGTGIGG I LRD
I L.lactis --I---~ TSEK-S -V-KSNS- -- P B.subtilis --I-DI G - N Q - W F K . . . . . . . . L-- -Q--A- -V- - - I - -
l Sta.aureus -V .... T--K-D ---QSSSS -- P La.casei ---IDI G E G K - W F K A ........ V---E--A--V---I--
L B.subtilis --M .... T--K-- ---KSSS- -- N E.coli
B.megaterium --I-V-- T-HK-- ---KSST- -- E S.cerevi.
- E.coli -LFA---L] RRFQDEEVQRDVS IMPFRI IAAD NG-AW-- ---KASS- -N E D,melano,
P.cepacia -LFAV--~i ---EEK---K- IGL--YS--K-- NG-AW-- ---KANS---
R.meliloti -LFA---~ -T-E-PTT-K-KGMV-Y -VK-- NG-AW-- -R-QASG- --
N Ca.crescentus -LFA---~i -TAS - PV-EK-KGMV-YRSSR-R AG-AW-K -R-QANG- --
Rru.ovis -LFAV--~- --YD-PM-TK-RDLV-Y -VKG- NG-AW-- -R QASG- --
Bo.burgdorferi
Chl.pneumoniae
--Y .... ~- . . . . EE -ASEI KMV-Y--EKGL
-LG-T--~I- -KY S--ASEIQTV-YTVTSGS
NG-AR-N
KG-AVF-
-R--SSS- --
-R--ASS- -Q
(c)
- Synechocystis sp. -FY-V--~I- -R- D-ITNEATEVAYSVVKDG NGNVKLD -S-T-AST -P
~
- S.cerevisiae (m) -LFAT--~[ . . . . E-A ..... IKQV-Y--VKHS NG-AW-- --VA-SS- --
Saccharomyces Escherichiacoli
I
D.melanogaster(m) - F Y A T - - ~ . . . . D-P--KK- ITNLSY-WK-S NG-AW-S -V-QSSG- --
Pea (m) -LFGT--~I . . . . D-AQT-KEM}tMV- Y -VR-P NG-AW-- ---RSSG- -- cerevisiae
O Tr.cruzi {m) -FFAV--[,I . . . . E-SNI-H-IKNV-Y -GRSS NG-AW-Q ---TASG- --
Mouse (m) -FYAT--~---YD-R---~-T~V VR-S .... NG-AW-- -V-QSSG- --
Pa.lutherii(chl) -F~-V--~i'~--ps R--SREr.RQT-~--EDS R~KIRr, R ---S-AST -P / ~ Methanobacteriurn
Cr.phi (chl) W-S- - -SEELKQVSY IVKTDS NGNIKLD ---T-AST -P
- Pea (chl) -~ S--DEESRQVSY~V-RD- NG~LD ---T-AST -P
- S.cerevisiae(BiP) LKYN-RS - -K- IKHL - - N V ~ - G KPAVE- S
H~iman (BiP) -TWN- PS - -Q- IKFL- - -VVEKKT KPYIQ-D
G.lamblia(BiP) -K- D- P---K-MKLL-Y-V-NK-G RPFVQLS
Drosophila "~.
E
G.lamblia
En.histolytica
Maize
Soybean
~ -N- P- - -A-LKHFRSRSSCGPTR
- - -S -PAI -N-MKHWS --V-DDGH
---SSPA--SSMKLW-SRHLGL G
---S-SS--N-MKLW-- VGGSPC
TPQIQ-V
KPLIE--
KPMIVFN
KPMIV-N
melanogaster I"~LaLactobaci/lus
Ch.reinhardtii -K-S-PI --S- IKLW-SQVAP-H VPEIV-S Bacillus casei
S.cerevisiae AI -N-N-P---A-~[KHF-- L-DV-G KPQIQ--
D.melanogaster 2 -KYD-PKIAE-MKHW---WSDGG KPK[G--
subtilis
X.laevis -K-N- PV--C- L K H W - - Q W S D E G KPKVK--
Chicken -KYD- PT- - S-MKHW- -RVVNEGG KPKVQ--
- Human --D-AV--S-MKHW--MVVNDAG RPKVQ--
Rgure 2
(a) Sequence signatures in the hsp70 protein family showing the relatedness of eukaryotic cytosolic homologs to the Gram-negative
eubacteria. The letters A, E, N, P and 0 refer to sequences from archaebacteria (A), eukaryotes (cytosolic and endoplasmic reticulum; E),
Gram-negative bacteria (N), Gram-positive bacteria (P) and eukaryotic organellar sequences [namely mitochondria (m) and chloroplasts (chl);
0], respectively. The numbers on the top refer to the position in the Halobacterium marismortui hsp70 sequence. The dashes (-) indicate
identity with the amino acid in the top line. Symbols above or below the shaded regions refer to different types of sequence signatures: II,
sequence signatures common to only archaebacteria and Gram-positive bacteria; e , sequence features present only in eukaryotic cytosolic
and ER (BiP) homologs; and A, shared sequence features between Gram-negative eubacteria and eukaryotic homologs. Not all signature
sequences of the above kinds are shown. The boxed region shows the large insert present in only Gram-negative eubacteria and eukaryotic
homologso Sequences from only representative species are shown, however, homologs from all other species examined also contain the
indicated signature sequences. Abbreviations used in the species names are as follows: B, Bacillus; Bo, Borellia; Bru, Brucefa; C,
Clostridium; Ca, Caulobactec, Ch, Chlamydornonas; Chl, Chlamydia; Cr, Cryptomona; D, Drosophila; E, Escherichia; En, Entamoeba; G,
Giardia; H, Halobacterium; L, Lactococcus; M, Methanosarcina; My, Mycobacterium; P, Pseudomonas; Pa, Pavlova; R, Rhizobium; S,
Saccharomyces; St, Streptomyces; Sta, Staphylococcus; T, Thermoplasma; Tr, Trypanosoma; X, Xenopus. The notations (mit), (chl) and (BiP)
in parenthesis denote mitochondrial, chloroplast and ER-resident forms of hsp70, respectively. The database accession numbers of these
sequences are described in earlier work24. (b) Alignment of sequences of phosphoribosylformylglycinamidine synthase (FGARAT) from
various species. The species names and GenBank accession numbers of the sequences are as follows: Escherichia coil (M19501),
Saccharomyces cerevisiae (U14566), Drosophila melanogaster (U00683), Methanobacterium thermoautotrophicurn (Me. therrn.) (M59245),
Bacillus subtilis (J02732), Lactobacillus (La.) casei (M85265). The shaded regions show the shared insertions in Gram-negative bacteria and
eukaryotes. (r A distance tree based on the FGARATsequences.
Gram-negative bacteria, the phylogenetic between the eucarya and the Gram- these groups is also strongly supported
trees based on these protein sequences negative bacteria is the enzyme phos- by phylogenetic analysis of the se-
are otherwise conventional and indi- phoribosylformylglycinamidine synthase quence data (Fig. 2).
cate relationships between organisms (FGARAT). From a partial alignment of To investigate this problem further,
that would have been expected (in the FGARAT sequences presented in Fig. 2, we have collected all proteins in the
sense that plants cluster together, it is readily seen that a number of con- databases for which sequences were
vertebrates cluster together and mito- served indels (i.e. either insertions or available at the time from Gram-positive
chondria and chloroplast homologs are deletions) are uniquely shared by bacteria (P), Gram-negative bacteria
specifically related to ~-proteobacteria Gram-negative bacteria and eukaryotic (N), archaea (A) and eucarya (E)27. A
and cyanobacteria, respectively) n-m3~ homologs on one hand, and Gram-posi- total of 24 proteins were found that ful-
Another example of a protein sequence tive bacteria and archaebacteria on the filled these criteria. After alignment of
that demonstrates a close relationship other. A specific relationship between the sequences, they were evaluated to
168
T1BS 21 - MAY 1996 TALKINGPOINT
determine which of the three possible forms of hsp70 consistently tree within
TaMe I. Global phylogeny of protein
monophyletic arrangements they sup- the ~-proteobacteria, and the chloro- sequences a
port [namely the archaebacterial tree plast forms within the cyanobacte-
(P,N)(A,E), the tree supported by hsp70, ria 23,24,30,31. The cytosolic and ER forms Eukaryote-Gram-negative clades
gdh and glna (P,A)(N,E), or the third of hsp70, which are found in all eukary- Aspartate aminotransferase
possible topology (P,E)(N,A)]27. The otic species, including the amitochon- Hsp70
analysis was performed using both the drial protist Giardia lamblia 25, contain Ferredoxin
Glutamate dehydrogenase
maximum likelihood and the maximum distinctive signature sequences that are
Glutamine synthetase
parsimony methods27. The results are not present in any organellar homologs Phosphoribosylformylglycinamidinesynthase
summarized in Table I. In addition to (see Fig. 2; Refs 24, 25). Thus, it is Pyrroline-5-carboxylatereductase
hsp70, glutamine synthetase, glutamate highly unlikely that the unusual branch-
dehydrogenase and FGARATproteins, a ing pattern for this protein might be a Eukaryote-Archaebacterial clades
number of other proteins significantly consequence of comparison between Arginino-succinatesynthase
ATP synthase a-chain
preferred a phylogeny that clusters the non-orthologous sequences.
ATP synthase 13-chain
eucarya with Gram-negative bacteria Second, it has been suggested that Dihydrofolate reductase
and the archaea with Gram-positive some of the unusual topologies could Elongationfactor 1
bacteria. In fact, the protein-based trees be because of horizontal gene trans- Elongationfactor 2
are almost evenly split between this fers2~ but there are several lines of Indole-3-glycerolphosphate synthase
topology and the presently favored evidence that show this is not the Phosphog[yceratekinase
topology that groups archaea with cause. For the hsp70 sequence data, DNA-directedRNA polymerase
eucarya. The proteins for which no this possibility has been ruled out 24'25.
Equivocal clades
significant preference was observed do Furthermore, if horizontal transfer is in- [~-galactosidase
not support or refute either phylogeny. voked to explain the unusual phylogeny Carbamoyl phosphate synthase
However, it is significant to note that of approximately half of all proteins, Deoxyribopyrimidinephotolyase
none of the proteins supported the then why must it occur only between Gtyceraldehyde3-phosphatedehydrogenase
third phylogeny (P,E)(N,A), which eucarya and Gram-negative bacteria on Superoxidedismutase
groups the Gram-positive bacteria with the one hand, and archaea and Gram- Anthranilate synthase components I and II
Anthranilate phosphoribosyitransferase
the eucarya. This result provides strong positive bacteria on the other? All of
N-(5'-phosphoribosyl)anthranilate isomerase
evidence that the methods are not the gene/protein sequences give: (1) a
generating false significance scores, be- similar relationship between various eu- Eukaryote-Gram-posltlveclade
cause the sequences are too distantly karyotic species; and (2) they also indi- None
related or no longer contain traces of cate a similar origin of mitochondria
their ancestry. If this were the case, an and chloroplasts from specific groups aA summary of the phylogenetic analysis
based on different protein sequences27. For
equal number of proteins would be of bacteria, thus arguing strongly the proteins listed under the first two cat-
expected to support each of the three against extensive gene transfers. egories above, the observed phylogenetic re-
alternative phylogenies. lationships in alI cases is statistically signifi-
cant. The proteins listed under the equivocal
Chimeric model for the origin of the clades did not yield statistically significant
Orthology-paralogy and horizontal gene eukaryotic nuclear genome relationships27.
transfer If the observed results are not anoma-
How can the two different kinds of lous, then to explain them it is necess-
phylogenies inferred from these protein ary to postulate that the eukaryotic the different characteristics of the two
sequences be reconciled? Because some nuclear genome received major contri- parents. This in turn explains the many
of the proteins suggest the same topol- butions from both an archaebacterium cellular and biochemical features that
ogy as the rRNA sequences, should we as well as a Gram-negative eubacterium. are shared by eukaryotes with the
prefer that particular inference and ig- Or in other words, it is a chimera archaea on the one hand and with the
nore the others? It is important to ex- formed by the fusion of these groups of eubacteria on the other4-6.
clude two possibilities that could lead parental organisms. Based on Rivera During the evolution of the eukary-
to incorrect phylogenies before consid- and Lake's results with the EF-la and otic genome from the chimeric ances-
ering other models to explain these re- EF-2 sequences ~6, the archaebacterial tor, it is expected that the genes/pro-
sults. First, all sequences used to recon- partner is likely to be a thermoacido- teins that were already functioning as a
struct phylogenies must be orthologous. philic 'eocyte' group of archaebac- unit within cells (e.g. translation and
If they are not then an incorrect phy- terium. The nature of the Gram-negative perhaps also transcription apparatus)
logeny will be inferred. However, for fusion partner is not clear at present, were taken as a whole from one of the
many of the proteins that fail to cluster but in the hsp70 phylogeny, a closer parents. Indeed, the sequence data on
the archaea and eucarya (e.g. hsp70, relationship of eukaryotic cytoplasmic various translational/transcriptional re-
glutamine synthetase and glutamate homologs to the cyanobacterial species lated genes (e.g. rRNA, ribosomal pro-
dehydrogenase), multiple forms are is often observed 23,z4. Following the fu- teins, EF-G,EF-Tu,RNA polymerase, TATA-
well known and have been incorporated sion event, the two sets of genes/pro- element-binding protein, etc.) s2 suggest
into their phylogenies2]-26. In fact, the teins that were present in the resulting that they originated from the archae-
distinct clustering of various hsp70 iso- pro-eukaryote cell had then to be re- bacterial ancestor. In view of this, it is
forms into their phylogenetic trees solved into a single functioning cell. By probable that the translational/tran-
gives further confidence that the phy- choosing a mixture or set of genes from scriptional-related gene sequences do
logeny inferred from these proteins is each of the two parental organisms, a not provide independent estimates of
reliable. For example, the mitochondrial eukaryotic cell evolved that inherited the eukaryotic phylogeny. Hence, the
169
TALKINGPOINT TIBS 21 - MAY 1996
Concludingremarks
The model proposed here can ex-
Figure 3 plain many of the distinctive genetic,
Origin of the eukaryotic cell nucleus and endomembrane system by the chimeric model. biochemical and structural characteris-
(a) The key event in the origin of the eukaryotic cell is postulated to be the engulfment of an tics of the eukaryotic cells, including the
'eocyte' archaebacterium by a Gram-negativeeubacterium that presumably lacked a cell wail.
(b) As the membrane of the host surrounded the guest species, its own membrane (con-
ester-linked nature of their membrane
taining ether-linked lipids) became redundant and was lost. (c} Eventual separation of the lipids 1-6,12. The origin of some of the
internalized membrane from the plasma membrane led to the formation of the nuclear en- other distinctive features of eukaryotic
velope and the endoplasmic reticulum. The formation of these new compartments was pre- cells (e.g. cytoskeleton) remains unclear
ceded or accompanied by duplication of the genes for the chaperone proteins (e.g. hsp70, at present, however, structural com-
hsp90, etc.). The transfer of the host genome to the newly formed nucleus, and an assortment ponents for these could also be derived
of the genes from the two parents, led to the formation of the ancestral eukaryotic cell. from one of the prokaryotic parents 3~-a7.
By contrast to the presently accepted
interpretation of the rooting of the uni- which evolved by a gene duplication view, this model proposes that the an-
versal tree based on such sequences event that took place very early in cestral eukaryotic cell evolved by a
(namely EF-Tu/G) is questionable ~4,~7,33. the evolution of eukaryotic cells25. A very different mechanism than that re-
similar inference has been reached for sponsible for the diversity seen within
Origin of nucleus and endoplasmicreticulum the ER and cytosolic forms of the hsp90 the prokaryotic world or subsequently
Important insight into the origin of family of proteins 34. These observations within the eukaryotic world. Unlike the
the eukaryotic cell nucleus is provided strongly indicate that the formation of prokaryotic species, which are homo-
by the proteins that are found in the ER. ER (and through inference, nuclear en- genomic, all of the eukaryotic species
As the ER is contiguous with the nu- velope) was accompanied by dupli- are indicated to be heterogenomic 38.
clear envelope as well as the cytomem- cation of the genes for proteins that Furthermore, our observation that all of
brane in eukaryotic ceils, its evolution facilitate membrane transport of other the eukaryotic, cytosolic and ER hsp70s
most probably took place in concert proteins. contain a number of shared sequence
with the nucleus 5,12,2s. For a number of These observations and the chimeric signatures, which are not found in any
proteins (namely hsp70 and hsp90), nature of the eukaryotic nuclear genome prokaryotic or organellar homologs,
which function as molecular chaper- discussed above could be explained if strongly suggests that the major evolu-
ones in the transport of other 'passen- the origin of eukaryotic ceils involved tionary transition from the prokaryotes
ger proteins' across membranes, dis- an endosymbiotic event between a Gram- to the eukaryotes was the result of a
tinct homologs are found in the ER and negative eubacterial host and an archae- single, unique endosymbiotic fusion
cytosolic compartments 2s'34. Recently, bacterial 'eocyte' species 12,24,25. In one event 2s,39. Thus, the model proposes
the genes for both ER and cytosolic possible scenario, shown in Fig. 3, a that, by contrast to the 'pre-karyotic'
hsp70s have been cloned from G. lamblia, Gram-negative eubacterium lacking a nature of the prokaryotes, all of the
which is one of the most primitive cell wall developed a symbiotic relation- eukaryotic species are 'post-karyotes',
eukaryotic species presently known25. ship with an archaebacterium. The host where karyosis refers to the original
The cloning of a gene encoding an ER eubacterium, either naturally or as a re- endosymbiotic event. Lastly the protein-
hsp70 homolog from G. lamblia pro- sult of this association, developed nu- based phylogenies also raise important
vides strong evidence that the ER origi- merous membrane infolds. Over a period questions concerning the relationship
nated very early in the eukaryotic cell of time the membrane of the host organ- between archaebacterial and eubac-
history. Both the ER and cytosolic ism completely surrounded the guest terial species, some of which have been
hsp70s from all eukaryotic species share species, which became the nucleus and discussed previously23,z427'4~
a number of sequence features with some of the membrane infolds devel-
the Gram-negative bacteria (including oped into the ER. (The membrane of the Acknowledgements
the 23-27 amino acid insert in the guest species, which became redundant We thank J. Cechetto for assistance
amino-terminal quadrant), but they also under these conditions was eventually with the artwork. Work from R. S. G.'s
contain numerous unique shared se- lost.) The formation of the nuclear en- laboratory is supported by a research
quence signatures that are not present velope, or ER, involved the creation of a grant from the Medical Research Council
in any prokaryotic or organellar homo- new compartment in the cell that had of Canada. G. B. G. is a fellow of the
logs (see Fig. 2; Ref. 25). Phylogenetic to communicate (i.e. import/export pro- Canadian Institute for Advanced
analyses of ER and cytosolic hsp70s teins and other molecules) with the rest Research and is supported by a Natural
from various eukaryotic species indi- of the cell. Therefore, formation of this Sciences and Engineering Research
cate that they form distinct subfamilies, compartment was either accompanied Council of Canada grant.
170
TIBS 21 - MAY 1996 LETTER
References 15 Woese, C. R., Kandler, O. and Wheelis, M. L. 28 Steel, M. A., Lockhart, P. J. and Penny, D.
1 Margulis, L. (1970) Origin of Eukaryotic Cells, (1990) Proc. Natl Acad. Sci. USA 87, 4576-4579 (1993) Nature 364, 440-442
Yale University Press 16 Rivera, M. C. and Lake, J. A. (1992) Science 29 Morimoto, T. I., Tissieres, A. and
2 Stanier, R. Y., Dondroff, M. and 257, 74-76 Georgopoulos, C. (eds), (1994) The Biology of
Adelberg, E. A. (1990) The Microbial World 17 Hartman, H. (1984) Spec. Sci. Technol. 7, 77-83 Heat Shock Proteins and Molecular Chaperones,
(3rd edn), Prentice Hall 18 Sogin, M. L. (1991) Curr. Opin. Genet. Dev. 1, Cold Spring Harbor Laboratory Press
3 Knoll, A. H. (1992) Science 256, 622-627 457-463 30 Boorstein, W. R., Zeigelhoffer, T. and Craig, E. A.
4 Woese, C. R. (1981) Sci. Am. 244, 98-122 19 Puhler, G. et al. (1989) Proc. Natl Acad. Sci. (1994) J. Mol. Evol. 38, 1-17
5 Cavalier-Smith, T. (1987) Ann. NYAcad. Sci. USA 86, 4569-4573 31 Falah, M. and Gupta, R. S. (1994) J. Bacteriol.
503, 17-54 20 Smith, M. W., Feng, D. F. and Doolittle, R. F. 176, 7748-7753
6 Zillig, W. (1991) Curr. Opin. Genet. Dev. 1, (1992) Trends Biochem. Sci. 17,489-493 32 Klank, H-P. and Doolittle, R. F. (1994) Curr. Biol.
544-551 21 Tiboni, 0., Cammarano, P. and 4, 920-922
7 Gray, M. W. (1992) Int. Rev. Cytol. 141, 233-357 Sanangelantoni, A. M. (1993) J. Bacteriol. 175, 33 Doolittle, R. F. (1995) Proc. Natl Acad. Sci. USA
8 Margulis, L. (1993) Symbiosis in Cell Evolution 2961-2969 92, 2421-2423
(2nd edn), W.H. Freeman & Co. 22 Benachenhou-Lahfa,N., Forterre, P. and 34 Gupta, R. S. (1995) Mol. Biol. Evol. 12,
9 Mereschkowsky,C. (1905) Biol. Zbl. 25, 595-604 Labedan, B. (1993) J. Mol. Evol. 36, 335-346 1063-1073
10 Woese, C. R. (1987) Microbiol. Rev. 51, 221-271 23 Gupta, R. S. and Golding, G. B. (1993) J. Mol. 35 Bermudes, D., Hinkle, G. and Margulis, L.
11 Lake, J. A., Henderson, E., Clark, M. W. and Evol. 37,573-582 (1994) Microbiol. Rev. 58, 387-400
Matheson, A. T. (1982) Proc. Natl Acad. Sci. 24 Gupta, R. S. and Singh, B. (1994) Curr. Biol. 4, 36 Erickson, H. (1995) Cell 80, 367-370
USA 79, 5948-5952 1104-1114 37 Gupta, R. S. and Soitys, B. J. Biochem. Mol.
12 Lake, J. A. and Rivera, M. C. (1994) Proc. Natl 25 Gupta, R. S., Aitken, K., Falah, M. and Singh, B. Biol. Int. (in press)
Acad. Sci. USA 91, 2880-2881 (1994) Proc. Natl Acad. Sci. USA 91, 2895-2899 38 Margulis, L. (1996) Proc. Natl Acad. Sci. USA
13 Iwabe, N. et al. (1989) Proc. Natl 4cad. Sci. 26 Brown, J., Masuchi, Y., Robb, F. T. and Doolittle, 93, 1071-1076
USA 86, 9355-9359 W. F. (1994) J. Mol. Evol. 38, 566-576 39 Szathmary, E. and Maynard-Smith, J. (1995)
14 Gogarten, J. P. et al. (1989) Proc. Natl Acad. 27 Golding, G. B. and Gupta, R. S. (1995) Mol. Nature 374, 227-232
Sci. USA 86, 6661-6665 Biol. Evol. 12, 1-6 40 Gupta, R. S. (1995) Mol. Microbiol. 15, 1-11