Syntenic dotplot

From CoGepedia
Revision as of 11:05, 18 February 2010 by Elyons (Talk | contribs)

Jump to: navigation, search

Syntenic dotplots are a type of scatter-plot. Each axis represents a sequence laid end-to-end, and each dot in the scatter-plot represents a putative homologous match between the two sequences. Often, these dotplots are used for whole genome comparisons within the same genome or across two genomes from different taxa in order to identify synteny. Synteny is defined as two or more genomic regions that are derived from a common ancestral genomic region. The evidence for synteny is the identification of a set of homologous genes in two genome that have a collinear arrangement. When such a pattern of gene-order conservation is discovered, the most parsimonious explanation is that the two regions are related through a common ancestor. While syntenic dotplots are useful for identifying related genomic regions, they are also useful for identifying genomic regions that have undergone an evolutionary change in one of the two genomes being compared. For example, insertions, deletions, duplications, and inversions are readily identifiable from these plots.

CoGe's tool SynMap makes it easy to create a syntenic dotplot for any two genomes in CoGe.

Syntenic dotplot of E-coli B strain REL606(x-axis) and E-coli K12 strain DH10B (y-axis). The "green" line represents the regions of similarities between the two genomes while the discontinuities in this syntenic line represent regions of genomic variations at a given locus between the two substrains of E-coli. Variations of this size (10s of kb) are usually the result of phage insertions, horizontal gene transfers, deletions, and transposon activity. More information about this comparison can be found here. This dotplot can be regenerated here.
Syntenic dotplot with Ks coloration of sorghum (x-axis) versus maize (y-axis). Genes are used for axis metrics; black lines separate chromosomes in each genome. Results can be regenerated at: http://tinyurl.com/y9e778s. Red syntenic lines are from the maize-specific whole genome duplication event and are orthologous to sorghum. Purple are from the older pre-grass whole genome duplication event are are out-paralogs. More information about this analysis can be found here.