Difference between revisions of "Plant paleopolyploidy"

From CoGepedia
Jump to: navigation, search
(17 Banana Beta)
(17 Banana Beta)
Line 92: Line 92:
 
===17 Banana Beta===
 
===17 Banana Beta===
  
The publication of the banana genome reported two whole genome duplications with roughly similar rates of divergence indicating that the two events occurred at roughly the same time. This is the older of the two events and is dated to approximately 65 million years ago.
+
The publication of the banana genome reported two whole genome duplications with roughly similar rates of divergence indicating that the two events occurred at roughly the same time<ref name="banana">D'Hont A et al (2012) "The banana (Musa acuminata) genome and the evolution of monocotyledonous plants." ''Nature'' DOI:[http://dx.doi.org/10.1038/nature11241 10.1038/nature11241]</ref>. This is the older of the two events and is dated to approximately 65 million years ago.
  
 
===18 Banana Alpha===
 
===18 Banana Alpha===

Revision as of 17:49, 4 September 2012

Identifying and characterizing plant paleopolyploidies is ongoing research. They are identified through whole genome comparisons using a combination of the data derived from genomic structure (e.g. syntenic dotplots) and evolutionary distances (e.g. synonymous mutation rates). As such, detecting these events and determining which lineages share what subset are continually changing. The images presented here represent our views right now, but are subject to change. Events that were previously undetected or missed can suddenly be seen with an improved build of a genome or the sequencing of a fortuitously placed outgroup.

This webpage in maintained by James Schnable, a member of the Freeling Lab and the CoGe development team. As new genomes become available, and previous genomes are updated, I will continue to improve these figures. If you know of a new whole genome duplication not listed here, a paper that should be credited but isn't, or interesting information about any specific whole genome duplication that you think should be included in the summaries please don't hesitate to contact me so I can get the information up here.


Plant Phylogenetic Tree With WGD Marked

Phylogenetic tree of plant species with sequenced genomes, with ancient whole genome duplications marked. Branch lengths are not proportional to anything.

Discussion of individual tetraploidies

1 Eudicot Hexaploidy

Synonyms: Arabidopsis Gamma

This hexaploidy (genome tripling) is shared by the core eudicots (the rosids and asterids), and may be present in additional, basal eudicots, although it will not be possible to reach this conclusion until the genomes of species from basal lineages are sequences -- AND WELL ASSEMBLED!

This whole genome duplication was first identified as the most ancient of three whole genome duplication present in the genome of Arabidopsis thaliana, and assigned the name "gamma" by in 2003 by Bowers et al. [1]:

In this paper, hampered by the fact that only two plant genomes had yet been sequenced, and the ridiculously accelerated rate of base pair substitution in arabidopsis the authors concluded the gamma event was likely shared by both monocots and eudicots and could potentially be as old as the split between gymnosperms and flowering plants 300 million years ago.

With the publication of the grape genome in 2007[2] which has not experienced any duplications since the eudicot hexaploidy and doesn't show the same acceleration of nucleotide substitutions, it became possible to conclude that the eudicot hexaploidy was NOT shared with monocots and was shared by all rosids.

More recent work in the asterids[3] indicate that this highly successful eudicot clade share the same ancient whole genome duplication.

2 Arabidopsis alpha

The alpha tetraploidy of arabidopsis was first given that name in Bowers et al 2003[1]. It is shared by most or all of the crucifers (family Brassicaceae).

3 Arabidopsis beta

As of yet not a single lineage has been identified in which the beta tetraploidy (naming conventions from Bowers et al 2003[1]) is the most recent whole genome duplication. Despite what the image above might indicate, this duplication is significantly older than arabidopsis alpha, however precise dating is difficult given the acceleration of synonymous substitution rate in the arabidopsis lineage.

4 Brassica hexaploidy

The hexaploidy shared by all species in the genus Brassica is well known. So well known, I don't know what the proper citation for the discovery of this whole genome duplication is. If you know, please get in touch so the scientists responsible for this discovery -- however long ago -- can get proper credit.

5 Poplar tetraploidy

When the genome of poplar was released back in 2006, researchers announced that they had identified a new ancient whole genome duplication[4]. Poplar retains around 8000 pairs of duplicated genes.

7 Apple tetraploidy

The genome paper of apple also discussed an ancient whole genome duplication identified in that linage which they estimated to be >50 million years old [5]

8 Soybean tetraploidy

The relatively recent whole genome duplication in soybean was long suspected based a number of different forms of analysis, from analysis of Ks peaks[6], phylogenies of individual gene families[7] and analyzing the fractionation of individual sequenced regions[8]. As expected, when the genome of soybean was published in 2010, researchers did identify a recent whole genome duplication (peak ks=0.13, estimated age 13 million years ago).

9 Legume tetraploidy

The whole genome duplication shared by all legumes was initially inferred by comparison of the medicago and lotus genome assemblies to data from peanut (Arachis hypogaea) a basal lineage of that clade[9]

10 Columbine tetraploidy

11 Flowering plant tetraploidy

An analysis of conserved orthologous gene groups (COGs) and huge numbers of ESTs identified evidence of two ancient whole genome duplications shared by both monocots and eudicots.[10] The more recent of the two, placed at 192 million years ago, occurred after the split of gymnosperms (non-flowering seed plants) but is shared by all extant flowering plant species including Amborella trichopoda.

12 Seed Plant Tetraploidy

An analysis of conserved orthologous gene groups (COGs) and huge numbers of ESTs identified evidence of two ancient whole genome duplications shared by both monocots and eudicots.[10] The more ancient of the two events is shared by all flowering plants as well as gymnosperms, but after the divergence from Selaginella, a basal vascular plant.

13 Maize Tetraploidy

The suspicion that maize is an ancient polyploid can be traced back through at least a generation of maize geneticists, and finds its earlier roots in the large number of duplicate mutant loci found in maize, sometimes found in parallel orders along different chromosomes. Perhaps the most famous of these are the pairs of duplicate regulators of anthocyanin biosynthesis: aleurone1 and Purple plant1.

Brandon Gaut and John Doebley concluded maize was an allopolyploid back in 1997.[11] While whether maize is an allo- or auto- polyploid has been argued back and forth over the years, the polyploidy question was settled more than a decade before the publication of the first draft of the maize genome.

The two subgenomes of maize are estimated to have diverged ~12 million years ago[12]. If maize is an autopolyploid, the two genomes also merged into a single genome 12 million years ago, but if maize is an allopolyploid the two genomes could have evolved as separate species for several million years before the wide cross that created the polyploid ancestor of modern maize. In either case, the two ancestral genomes of maize have been contained in the same nucleus for at least five million years[12]. By comparing the organization of the maize genome to other grass species it is possible to reconstruct the ten pairs of homeologous chromosomes present in that first polyploid maize ancestor [13]. By measuring biased gene loss (fractionation) and biased expression of duplicate pairs, it is possible to assign one copy of each ancestral chromosome pair in maize to a parental subgenome, either maize1 or maize2[14]

14 Grass Tetraploidy

Synonyms: Rho

All grass species sequenced to date share a common whole genome duplication. This duplication is estimated to have occurred between 70-90 million years ago, and homeologs from this duplication have a modal synonymous substitution rate ~0.9.[15]

While the fact that a significant portion of the rice is covered by duplicate syntenic region has been known for many years, the reason for this duplication was disputed, ranging from multiple segmental duplications to one -- or more -- whole genome duplications.

There were two reasons for this confusion.

  • Only 65.7% of the rice genome is covered by syntenic duplicate regions.[16]
  • The duplicate homeologous region located on chromosomes 11 and 12 of rice and 5 and 7 of sorghum have continued to experience gene conversion, so this pair duplicate regions appears much younger than the rest of the duplicate regions in grass genomes.[17]

15 Monocot Tetraploidy A

Synonyms: Sigma

By using the genomes of both rice and sorghum to reconstruct a gene order present in duplicate segments prior to the shared tetraploidy in all grasses, Tang and coworkers were able to identify an even more ancient whole genome duplication in the monocot lineage. [18] They estimated age of 130 million years and a median synonymous substitution rate between gene pairs of ~1.7 for this duplication. However, the authors caution that these estimates could be very off as synonymous substitution rates of these gene pairs are close to saturation and there are potentially confounding effects from mutation rate variations between multiple lineages.

16 Monocot Tetraploidy B

By comparing sigma duplicate regions in the grasses to the grape genome, the same research group as above[18] found that at least in some cases eight sigma regions showed detectable synteny to a region of the grape genome. This suggests there were in fact two whole genome duplications in the monocot lineage following the monocot-eudicot split but before the pre-grass duplication shared by all grasses.

17 Banana Beta

The publication of the banana genome reported two whole genome duplications with roughly similar rates of divergence indicating that the two events occurred at roughly the same time[19]. This is the older of the two events and is dated to approximately 65 million years ago.

18 Banana Alpha

19 Cotton WGD

20 Solanum hexaploidy

References

  1. 1.0 1.1 1.2 Bowers JE et al (2003) "Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events." Nature DOI: 10.1038/nature01521
  2. Jaillon O et al (2007) "The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla." Nature DOI: 10.1038/nature06148
  3. Cenci A et al (2010) "Comparative sequence analyses indicate that Coffea (Asterids) and Vitis (Rosids) derive from the same paleo-hexaploid ancestral genome." Molecular Genetics and Genomics DOI: 10.1007/s00438-010-0534-7
  4. Tuskan GA et al (2006) "The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray)" Science DOI: 10.1126/science.1128691
  5. Valasco R et al (2010) "The genome of the domesticated apple (Malus × domestica Borkh.)" Nature Genetics DOI: 10.1038/ng.654
  6. Schlueter JA et al (2004) "Mining EST databases to resolve evolutionary events in major crop species." Genome DOI: 10.1139/g04-047
  7. Pfeil BE et al (2005) "Placing paleopolyploidy in relation to taxon divergence: a phylogenetic analysis in legumes using 39 gene families." Systematic Biology DOI: 10.1080/10635150590945359
  8. Schlueter JA et al (2008) "ractionation of synteny in a genomic region containing tandemly duplicated genes across Glycine max, Medicago truncatula, and Arabidopsis thaliana." Journal of Heredity DOI: 10.1093/jhered/esn010
  9. Bertioli DJ et al (2009) "An analysis of synteny of Arachis with Lotus and Medicago sheds new light on the structure, stability and evolution of legume genomes" BMC Genomics DOI:10.1186/1471-2164-10-45.
  10. 10.0 10.1 Jiao Y et al (2011) Ancestral polyploidy in seed plants and angiosperms." Nature DOI: 10.1038/nature09916
  11. Gaut BS (1997) "DNA sequence evidence for the segmental allotetraploid origin of maize." Proceedings of the National Academy of Sciences DOI: NA Link.
  12. 12.0 12.1 Swigonova Z et al (2004) "Close Split of Sorghum and Maize Genome Progenitors." Genome Research DOI: 10.1101/gr.2332504
  13. Wei F et al (2007) "Physical and Genetic Structure of the Maize Genome Reflects Its Complex Evolutionary History." PLoS Genetics DOI: 10.1371/journal.pgen.0030123
  14. Schnable JC et al (2011) "Differentiation of the maize subgenomes by genome dominance and both ancient and ongoing gene loss." Proceedings of the National Academy of Sciences DOI: 10.1073/pnas.1101368108
  15. Paterson AH et al (2004) "Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics." Proceedings of the National Academy of Sciences DOI: 10.1073/pnas.0307901101
  16. Yu J et al (2005) "The Genomes of Oryza sativa: A History of Duplications." PLoS Biology DOI: 10.1371/journal.pbio.0030038
  17. Wang X et al (2011) "Seventy Million Years of Concerted Evolution of a Homoeologous Chromosome Pair, in Parallel, in Major Poaceae Lineages." The Plant Cell DOI: 10.1105/tpc.110.080622
  18. 18.0 18.1 Tang, H et al (2010) "Angiosperm genome comparisons reveal early polyploidy in the monocot lineage." Proceedings of the National Academy of Science DOI:10.1073/pnas.0908007107
  19. D'Hont A et al (2012) "The banana (Musa acuminata) genome and the evolution of monocotyledonous plants." Nature DOI:10.1038/nature11241