CH391L/S14/Ancestral sequence

From SynBioCyc
Revision as of 13:38, 10 February 2014 by Cindy Chan (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Contents

Overview

What is Ancestral Sequence Reconstruction?

Ancestral Sequence Reconstruction is a method used to test molecular evolutionary hypotheses by deriving ancestral gene sequences from extant species. The gene sequences (usually in the form of amino acids) provide the template to synthesize biomolecules present in extinct species. This enables scientists to compare and contrast the extinct species from their diverged descendants and trace the origins and evolution of different proteins.

History

1963: L. Pauling and E. Zuckerkandl introduced the idea of reconstructing genes from extinct species from the inferred ancestral sequences (Pauling) 1971: W. Fitch proposed an algorithm to resurrect genes from extinct ancestors using the parsimony principle [1, 2, 2, 2, 2, 3, 4, 4, 4, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 6, 7, 8, 9, 9, 10, 10, 10, 11, 11, 11, 11, 11, 11, 11, 12, 12, 12, 12, 13, 13, 13, 13, 13, 13, 13, 14, 14, 15, 15, 16, 16, 17, 17, 17, 17, 17, 17, 17, 17, 17, 17, 18, 19, 20, 20, 20, 20, 20, 20, 20, 20, 20, 21, 22, 22, 23, 24, 25, 25, 26, 27, 28, 28, 29, 30, 31, 32, 32, 32, 32, 32, 32, 32, 33, 33, 33, 33, 33, 33, 34, 35, 35, 35, 35, 36, 37, 38, 39, 40, 40, 41, 41, 42, 43, 43, 44, 45, 45, 45, 45, 45, 45, 45, 45, 45, 45, 45, 45, 45, 45, 45, 45, 45, 46, 47, 47, 47, 48, 48, 49, 50, 51, 52, 52, 52, 53, 53, 54, 54, 55, 56, 57, 58, 58, 58, 58, 58, 58, 58, 59, 60, 60, 60, 60, 61, 62, 62, 63, 63, 63, 63, 63, 63, 63, 63, 64, 64, 64, 64, 65, 65, 65, 65, 65, 65, 65, 65, 65, 65, 65, 65, 65, 65, 65, 65, 66, 67, 68, 68, 68, 68, 68, 68, 68, 69, 69, 69, 69, 70, 71, 72, 72, 73, 73, 73, 74, 75, 75, 76, 76, 76, 76, 76, 76, 77, 77, 78, 79, 80, 80, 81, 81, 82, 83, 84, 84, 85, 86, 86, 87, 87, 88, 88, 89, 90, 90, 91, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 105, 105, 105, 106, 106, 107, 108, 109, 109, 110, 110, 110, 110, 110, 111, 111, 112, 113, 113, 114, 115, 115, 116, 116, 117, 117, 117, 117, 118, 119, 120, 121, 121, 121, 121, 122, 123, 124, 125, 125, 125, 126, 127, 128, 129, 129, 130, 131, 132, 133, 133, 134, 135, 136, 136, 136, 136, 137, 138, 139, 140, 141, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 150, 150, 151, 152, 153, 153, 153, 153, 154, 155, 156, 157, 158, 159, 159, 160, 161, 162, 163, 163, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 173, 174, 175, 175, 176, 177, 178, 179, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261]. Synthesized genes are typically cloned into a vector and used to transform a host organism.

Testing Ancestral Variants

After synthesis is complete, ancestral genes can be tested for their function. For example, the ancestral genes can be cloned into bacterial expression vectors and transformed into E.coli. The proteins can be purified and tested for their activity, whether that be fluorescence spectra, binding affinity, thermoactivity, reporter-gene expression, ligand or substrate binding, etc.

Examples of Ancestral Sequence Reconstructions

As the technology for ancestral sequence reconstruction has advanced the technique has popularized. It offers a unique way to peer into the past and revisit the biomolecules of our forefathers. The sensitivity of genetic information to degrade limits our understanding to extant organisms or circumstantially preserved ones, such as specimens preserved by ice. The ability to resurrect sequences from the dead and test them relieves us from these temporal limitations. Also, predicted mutations and their effects on proteins can be tested out and backed up with data by ASRs, which cannot be done by just structural observations.

Evolution of Coral Pigments

One example of ancestral sequence reconstruction was done by the Matz group (currently residing at the University of Texas at Austin). Fluorescent proteins from related coral species had wavelengths corresponding to Cyan, Green, and Red[262]. The details of the evolution of fluorescent color in the GFP superfamily was not fully understand. That is, what fluorescent spectra did the common ancestors of the modern corals have?

Different models for reconstruction based on amino acids, codons and nucleotides resulted in reconstructed proteins differing in 4-8 amino acids out of 217. Gene synthesis utilized codons designed "to be degenerate in order to incorporate alternative predictions." [262] The reconstructed sequences included red, pre-red, Red/Green and ALL. The ancestral sequences revealed an interesting evolutionary history. The ancestor to all the coral fluorescent proteins had a single green peak at 505nm. Gradually, a shift towards greater expression of lower wavelengths developed, culminating in the pre-red protein. One branch of pre-red went on to lose the ancestral green peak and develop a strong peak around 580nm. Modern red features significantly less emission at outside of the primary peak, thereby increasing its specificity compared to recent ancestors such as pre-red and Red/Green. According to Ugalde et al., incremental gains in protein function "[have] been anticipated since Darwin, but has only recently been demonstrated in computer simulation experiments"


Inferring the Paleoenvironment of ancient Earth

Ancestral sequence reconstruction has been used to infer the environmental conditions on the early Earth[263]. The study sought to explore the evolutionary history of an elongation factor thermo unstable (EFtu). This elongation factor functions optimally at the temperature which the organism lives, for example thermophilic organisms have an EFtu optimized for high temperatures. By resurrecting the EFtu protein in the common ancestors of bacteria, the temperature profile might elucidate what the environmental conditions were. Interestingly the EFtu common ancestor to all mesophilic bacterium (~1BYA) has an optimal temperature of a thermophile. This suggests that the hypothesis of a hot early Earth is true.

Precambrian β lactamases

The structure of the last common ancestor to extant β lactamase has been reconstructed and tested using Ancestral Sequence Reconstruction[264]. Sequences for β lactamase from 75 bacterial strains, whose last CA lived 2-3 Gya, were compared. To exclude the effects of recent antibiotic driven evolution, only non-clinical varieties were chosen. Using Bayesian statistics, Risso et al. calculated the most probabilistic ancestral amino acid for each point in the sequence. Resistance to various β lactam based antibiotics was conferred to modern microbes engineered to produce the Precambrian enzyme. The extremely high promiscuity of the Precambrian enzyme compared to extant sequences indicates that over the past few billion years, β lactamases have evolved greater substrate specificity. The Precambrian enzyme featured a denaturation temperature of ~90°C, which is 35°C above that of the highest extant sequence.

Precambrian Thioredoxin

Perez-Jiminez et. al resurrected various Precambrian Thioredoxins. These belonged to the last common ancestors of the last bacterial common ancestor (LBCA), the last archaeal common ancestor (LACA) and the archaeal-eukaryotic common ancestor (AECA)"[261]. All of these sequences are believed to have been present over 3.5 Gya. The amino-acid sequence in each case was determined by maximum likelihood inference from 200 extant sequences. The genes to code for these proteins were synthesized and cloned into E. coli for expression and purification.

All three reconstructed proteins showed a Tm around 113°C, with a ΔTm between the highest extant Thioredoxin and the ancestors of around 25°C. The three paleo Thioredoxins also showed substantially greater activity at pH5 than representative extant enzymes from the each domain. The lower substrate specificity exhibited by the reconstructed enzymes indicates the abundance of sulfur rich compounds in the early oceans of Earth, and also hints at the generalist nature of archaic enzymes.

Ancestral Alcohol Dehydrogenase (Thomson)

Present-day yeasts use two homologs of alcohol dehydrogenases (Adhs): Adh1 converts acetaldehyde to ethanol and Adh2 oxidizes ethanol to acetaldehyde.

Although the pathways to accumulate and consume ethanol consume ATP, the outcome of being able to outcompete other microorganisms for carbohydrates in fruits seems to outweigh the cost in energy. Thomson "et. al" reconstructed the Adh1A sequence from Adh1 and Adh2 sequences from S. cerevisiae and their relatives through the maximum likelihood approach. Studies on Adh1A show that its functions and kinetic properties more closely resembled Adh1 and conclusions are made that ancestral yeasts only make and do not attempt to accumulate ethanol.

Steroid Hormone Receptors (Brigham)

Mineralocorticoid receptor (MR) is mostly activated by aldoesterone and sometimes cortisol, while the glucocorticoid recepot (GR) is specifically activated by cortisol. Brigham "et. al" computed the ancestral gene sequence AncCR from 59 steroid receptor sequences using ML, BMCMC, and maximum parsimony to study the ancestral corticoid receptor. The existence of MR and GR is caused by the gene duplication of AncCR more than 450 million years ago, and AncCR is found to be more functionally similar to MR because it is has high affinity for aldosterone. Two crucial mutations in AncCR sequence, S106P followed by L111Q, converts the receptor to have GR phenotype.

Interestingly, aldosterone molecules and AncCR did not co-exist because aldosterone molecules only emerged during the evolution of tetrapods. Therefore, AncCR and its descendant genes were structurally preadapted to a hormone that wasn't produced at the time.

HCH degrading genotype (Sangwan)

Hexachlorocyclohexane (HCH) is a pesticide and pollutant that have only be found to be degraded by two Sphingobium species, Sphingobium japonicum UT26 found in Japan and Sphingobium indicum B90A found in India. Both species have the lin genes that enable them to degrade HCH; particularly significant are the linA, linB and linC genes. ASR results produced an ancestral sequence that only have the genes lin D, E, R, F, G, and H which suggest that the lin gene components that are responsible for degrading HCH appeared through horizontal gene transfer.

Further Applications

Ancestral sequence reconstruction holds various benefits. The most direct benefit is a better understanding of archaic organisms and the world they inhabited. Numerous ancient protein reconstructions have yielded enzymes that are much more thermally stable than extant derivatives, even those found in thermophiles, leading to stronger evidence for extremely warm global conditions prior to 1 Gya[265]. Archaic proteins also allow for greater study of evolvability, as many reconstructed proteins feature high promiscuity[264]. The patterns in the evolution of high promiscuity Precambrian β lactamase, for instance, could help further the fight against the development of antibiotic resistance. High promiscuity is also key to directed protein evolution. In addition, the reconstruction of archaic enzymes serves as a sort of paleo-bioprospecting[261], and is currently the best approach to expand thermal stability and pH tolerance in enzymes[266]. The enzymes discovered through this form of prospecting and those developed through directed evolution of promiscuous enzymes could serve functional roles in human engineered systems.

References

  1. Thornton, J. W. (2004). Resurrecting ancient genes: experimental analysis of extinct molecules. Nature reviews. Genetics, 5(5), 366–75. doi:10.1038/nrg1324 [Thornton]
  2. Perez-Jimenez, R., Inglés-Prieto, A., Zhao, Z.-M., Sanchez-Romero, I., Alegre-Cebollada, J., Kosuri, P., Garcia-Manyes, S., et al. (2011). Single-molecule paleoenzymology probes the chemistry of resurrected enzymes. Nature structural & molecular biology, 18(5), 592–6. doi:10.1038/nsmb.2020 [Perez]
  3. Ugalde, J. a, Chang, B. S. W., & Matz, M. V. (2004). Evolution of coral pigments recreated. Science (New York, N.Y.), 305(5689), 1433. doi:10.1126/science.1099597 [Ugalde]
  4. [http://www.nature.com/nature/journal/v451/n7179/full/nature06510.html Gaucher, E. a, Govindarajan, S., & Ganesh, O. K. (2008).

    Palaeotemperature trend for Precambrian life inferred from resurrected proteins. Nature, 451(7179), 704–7. doi:10.1038/nature06510]

    [Gaucher1]
  5. Risso, V. a, Gavira, J. a, Mejia-Carmona, D. F., Gaucher, E. a, & Sanchez-Ruiz, J. M. (2013). Hyperstability and Substrate Promiscuity in Laboratory Resurrections of Precambrian β-Lactamases. Journal of the American Chemical Society. doi:10.1021/ja311630a [Risso]
  6. Gaucher, E. a, Thomson, J. M., Burgan, M. F., & Benner, S. a. (2003). Inferring the palaeoenvironment of ancient bacteria on the basis of resurrected proteins. Nature, 425(6955), 285–8. doi:10.1038/nature01977 [Gaucher2]
  7. Kaganman, I. (2011). Resurrected enzymes. Nature Methods, 8(6), 452–452. doi:10.1038/nmeth0611-452 [Kaganman]
  8. Pauling, L. and E. Zuckerkandl. 1963. Chemical paleogenetics: molecular restoration studies of extinct forms of life. Acta Chem. Scand 17:89. CrossRef doi=10.3891 [Pauling]