Difference between revisions of "Syntenic dotplot"

From CoGepedia
Jump to: navigation, search
 
(56 intermediate revisions by 3 users not shown)
Line 1: Line 1:
[[Image:Dotplot.png|thumb|center|1000px|Syntenic dotplot between two '''Escherichia coli''' genomes:  (x-axis) B strain REL606 and (y-axis) strain K-12 substrain DH10BThis dotplot was generated using [[SynMap]] and can be regenerated [http://synteny.cnr.berkeley.edu/CoGe/SynMap.pl?dsgid1=7454;dsgid2=4241;D=20;g=10;A=5;w=0;b=1;ft1=1;ft2=1;dt=geneorder;autogo=1 here].The green line (collection of dots) represent regions of similarity along the two genomes. The arrows point towards the spots where insertion or deletion has occurred in genome of either strains. The numbers correspond to the descriptions in the table below that accounts for these gaps in the dotplots.]]  
+
[[Image:Dotplot.png|thumb|right|600px| Syntenic dotplot of E-coli B strain REL606(x-axis) and E-coli K12 strain DH10B (y-axis). The "green" line represents the regions of similarities between the two genomes while the discontinuities in this syntenic line (marked by numbered arrows) represent regions of genomic variations at a given locus between the two substrains of E-coli. Variations of this size (10s of kb) are usually the result of phage insertions, horizontal gene transfer events, deletions, and transposon activityMore information about this comparison can be found [[Analysis of variations found in genomes of Escherichia coli strain K12 DH10B and strain B REL606 using SynMap and GEvo analysis | here]].  More examples of bacterial syntenic dotplots and [[x-alignments]] can be found [[x-alignments | here]]. This dotplot can be regenerated [http://genomevolution.org/CoGe/SynMap.pl?dsgid1=7454;dsgid2=4241;D=20;g=10;A=5;w=0;b=1;ft1=1;ft2=1;dt=geneorder here].]]
  
   
+
[[Image:Master 6807 8082.CDS-CDS.blastn geneorder D40 g20 A10.w1200.gene.ks.png|thumb|right|600px|Syntenic dotplot with Ks coloration of sorghum (x-axis) versus maize (y-axis). Genes are used for axis metrics; black lines separate chromosomes in each genome. Results can be regenerated at: https://genomevolution.org/r/dfjy. Red syntenic lines are from the maize-specific [[whole genome duplication]] event and are orthologous to sorghum.  Purple are from the older pre-grass [[whole genome duplication]] event are are [[out-paralogs]].  More information about this analysis can be found [[Maize_Sorghum_Syntenic_dotplot | here]]. ]]
  
{| width="1000" cellspacing="1" cellpadding="1" border="1"
+
[[Image:Master 8154 8154.CDS-CDS.blastn.dag.go c4 D40 g20 A5.aligncoords.gcoords ct0.w2000.gene.ks.png|thumb|right|600px|Syntenic dotplot of poplar versus itself. Syntenic gene-pairs are colored by the [[synonymous mutation]] values. This reveals intragenomic synteny derived from a recent [[whole genome duplication]] event (dark blue) and the older [[eudicot paleohexaploidy]] event (green-cyan). This analysis can be regenerated at http://genomevolution.org/CoGe/SynMap.pl?dsgid1=8154;dsgid2=8154;c=4;D=40;g=20;A=5;Dm=;gm=;w=0;b=1;ft1=1;ft2=1;do1=1;do2=1;do=40;dt=geneorder;ks=1;am=g]]
|-
+
| Variation observed in B and K12<br>
+
| Description of variation<br>
+
| Link leading to GEvo <br>
+
|-
+
| 1. Deletion in B strain <br>
+
| DNA pol II, genes in metabolic pathway, thiamine ABC transporter present in K12 strain Transposase in K12 strain. Prophage CP46 DNA insertion in K12 strain. <br>
+
| http://tinyurl.com/yexrzpb<br>
+
|-
+
| 2. Insertions in K12 strain <br>
+
| Transposase in K12 strain. Prophage CP46 DNA insertion in K12 strain. <br>
+
| http://tinyurl.com/yd2quy7<br>
+
|-
+
| 3. Deletion in B strain or Insertion in K12 strain<br>
+
| Presence of lac operon and other metabolic enzymes genes in K12 strain. Pseudogene made by transposon insertion in K12.  
+
| http://tinyurl.com/yedn5tb<br>
+
|-
+
| 4. Insertion in B strain and DNA duplication event in K12. <br>
+
| Prophage DNA insertion in B strain. Recent DNA duplication event in K12 as seen by 100% identity between paralogs.
+
| http://tinyurl.com/yea8bu6<br>
+
|-
+
| 5. Insertion in B strain<br>
+
| Bacteriophage DNA insertion in B strain
+
| http://tinyurl.com/yevlb2w<br>
+
|-
+
| 6. Insertion in B strain<br>
+
| Prophage DNA insertion in B strain
+
| http://tinyurl.com/ybokuag<br>
+
|-
+
| 7. Insertion in K12 strain<br>
+
| Prophage DNA insertion in K12
+
| http://tinyurl.com/yaxlh7o<br>
+
|-
+
| 8. Insertion of transposon in K12 and B strain. Deletion of phenylacetic acid genes in B strain or insertion of those in K12. <br>
+
| Rac prophage DNA insertion in K12. Insertion of IS3 element protein and transposase in B strain. Insertion of IS30 transposon in B strain created pseudogene. Presence of phenylacetic acid degradation genes in K12
+
| http://tinyurl.com.ye7wcxe<br>
+
|-
+
| 9. Insertion in K12 strains. <br>Flagella synthesizing genes either deleted in B strain or inserted in K12. <br><br>
+
| Presence of flagella encoding/regulatory genes in K12. IS5 sequence insertion in K12. CP4-44 prophage DNA insertion in K12.
+
| http://tinyurl.com/ybh84x7
+
<br>
+
  
|-
+
[[Image:Master 8154 8154.CDS-CDS.blastn.dag.go c4 D40 g20 A5.aligncoords.gcoords ct0.w2000.gene.ks.hist.png|thumb|right|600px|Histogram of the [[synonymous mutation]] (Ks) values (log 10 transformed) of the syntenic gene pairs within poplar. Smaller values on left infers young gene pairs, and larger values on right infers older gene pairs. The two middle peaks are from poplar's recent whole genome duplication event (blue) and a more ancient [[eudicot paleohexaploidy]] event (green-cyan). The peak on the far right, with non-log10 transformed Ks values of 50-100 are noise in the analysis. Perhaps from the alignment of pseudogenes, mis-called syntenic gene pairs, and erroneous gene models.  These colors correspond to the colors used in the syntenic dotplot shown above.]]
| 10. Insertion in B strain. <br>Pili associated genes and metal resistance genes either inserted in K12 or deleted in B strain.<br><br>
+
| Bacteriophage DNA insertion in B strain. <br>IS1 sequence insertion in B strain. <br>Presence of pili associated genes in K12 together with genes conferring resistance to Co and Ni.<br><br>
+
| http://tinyurl.com/yammj7v<br>
+
|-
+
| 11. Insertion in K12 strain. ParB family protein and recombinase deleted in K12. <br>
+
| CP4-57 prophage DNA insertion in K12 strain. Presence of ParB family protein gene and recombinase in B strain<br>
+
| http://tinyurl.com/ycdog2c<br>
+
|-
+
| 12. Insertion in B and K12 strain. Inversion. Insertion of saframycin synthetase in B strain or deletion of that in K12. <br>
+
| Prophage DNA insertion in B strain. <br>Inversion of prepilin leader peptidase/methylase and M-type protein. Is5 insertion in K12. Presence of saframycin synthetase gene in B strain.  
+
<br>
+
  
| http://tinyurl.com/ycoagxh<br>
+
'''Syntenic dotplots''' are a type of scatter-plot. Each axis represents a sequence laid end-to-end, and each dot in the scatter-plot represents a putative [[homologous]] match between the two sequences. Often, these dotplots are used for whole genome comparisons within the same genome or across two genomes from different taxa in order to identify [[synteny]]. Synteny is defined as two or more genomic regions that are derived from a common ancestral genomic region. The evidence for synteny is the identification of a set of homologous genes in two genome that have a collinear arrangement. When such a pattern of gene-order conservation is discovered, the most parsimonious explanation is that the two regions are related through a common ancestor. While syntenic dotplots are useful for identifying related genomic regions, they are also useful for identifying genomic regions that have undergone an evolutionary change in one of the two genomes being compared. Example of such events are:
|-
+
*[[insertions]]
| 13. Insertion of transposon, ShiA-like, TrbC-like proteins in B strain. <br>
+
*[[horizontal gene transfers]]
| IS30 transposon insertion in B strain. Presence of ShiA-like and TrbC-like protein in B strain<br>
+
*[[deletions]]
| http://tinyurl.com/ydkrcv8<br>
+
*duplications
|-
+
*[[inversions]]
| 14. Insertion in B strain<br>
+
| Insertion of fimbriae synthesis genes, multi drug transporters, Is1 protein gene in B strain.<br>
+
CoGe's tool [[SynMap]] makes it easy to create a syntenic dotplot for any two genomes in CoGe.
| http://tinyurl.com/ycwsmsl<br>
+
|}
+
 
+
<br>
+

Latest revision as of 15:02, 24 July 2014

Syntenic dotplot of E-coli B strain REL606(x-axis) and E-coli K12 strain DH10B (y-axis). The "green" line represents the regions of similarities between the two genomes while the discontinuities in this syntenic line (marked by numbered arrows) represent regions of genomic variations at a given locus between the two substrains of E-coli. Variations of this size (10s of kb) are usually the result of phage insertions, horizontal gene transfer events, deletions, and transposon activity. More information about this comparison can be found here. More examples of bacterial syntenic dotplots and x-alignments can be found here. This dotplot can be regenerated here.
Syntenic dotplot with Ks coloration of sorghum (x-axis) versus maize (y-axis). Genes are used for axis metrics; black lines separate chromosomes in each genome. Results can be regenerated at: https://genomevolution.org/r/dfjy. Red syntenic lines are from the maize-specific whole genome duplication event and are orthologous to sorghum. Purple are from the older pre-grass whole genome duplication event are are out-paralogs. More information about this analysis can be found here.
Syntenic dotplot of poplar versus itself. Syntenic gene-pairs are colored by the synonymous mutation values. This reveals intragenomic synteny derived from a recent whole genome duplication event (dark blue) and the older eudicot paleohexaploidy event (green-cyan). This analysis can be regenerated at http://genomevolution.org/CoGe/SynMap.pl?dsgid1=8154;dsgid2=8154;c=4;D=40;g=20;A=5;Dm=;gm=;w=0;b=1;ft1=1;ft2=1;do1=1;do2=1;do=40;dt=geneorder;ks=1;am=g
Histogram of the synonymous mutation (Ks) values (log 10 transformed) of the syntenic gene pairs within poplar. Smaller values on left infers young gene pairs, and larger values on right infers older gene pairs. The two middle peaks are from poplar's recent whole genome duplication event (blue) and a more ancient eudicot paleohexaploidy event (green-cyan). The peak on the far right, with non-log10 transformed Ks values of 50-100 are noise in the analysis. Perhaps from the alignment of pseudogenes, mis-called syntenic gene pairs, and erroneous gene models. These colors correspond to the colors used in the syntenic dotplot shown above.

Syntenic dotplots are a type of scatter-plot. Each axis represents a sequence laid end-to-end, and each dot in the scatter-plot represents a putative homologous match between the two sequences. Often, these dotplots are used for whole genome comparisons within the same genome or across two genomes from different taxa in order to identify synteny. Synteny is defined as two or more genomic regions that are derived from a common ancestral genomic region. The evidence for synteny is the identification of a set of homologous genes in two genome that have a collinear arrangement. When such a pattern of gene-order conservation is discovered, the most parsimonious explanation is that the two regions are related through a common ancestor. While syntenic dotplots are useful for identifying related genomic regions, they are also useful for identifying genomic regions that have undergone an evolutionary change in one of the two genomes being compared. Example of such events are:

CoGe's tool SynMap makes it easy to create a syntenic dotplot for any two genomes in CoGe.