Maize v1 v2: Difference between revisions

From CoGepedia
Jump to navigation Jump to search
No edit summary
No edit summary
Line 6: Line 6:
[[Image:GEvo analysis of maize B73 refgen v1 v2.png|thumb|600px|right|Figure 3.  GEvo analysis of 1MB of chromosome 3 from maize between refgen version 1 and 2. Version 1 has gene models and non-CDS sequences masked (purple). Note the sets of genes that have been reoriented in version 2. These show up where regions of sequence similarity (pink blocks) are drawn below the dashed line in both panels.  Results can be regenerated at http://genomevolution.org/r/4du]]
[[Image:GEvo analysis of maize B73 refgen v1 v2.png|thumb|600px|right|Figure 3.  GEvo analysis of 1MB of chromosome 3 from maize between refgen version 1 and 2. Version 1 has gene models and non-CDS sequences masked (purple). Note the sets of genes that have been reoriented in version 2. These show up where regions of sequence similarity (pink blocks) are drawn below the dashed line in both panels.  Results can be regenerated at http://genomevolution.org/r/4du]]


These analyses compare the genomic sequence assembled of maize B73 refgen versions 1 and 2.  Maize was sequenced bac by bac, and bacs were chosen that tile across all of maize's chromosomes.  This means that the relative order of most bacs was correctly determined between and within a chromosome.  However, the sequences within a bac were often unordered, and the position of contig sequences within a bac relative to one another is not necessarily correct.  Therefore version 1 of maize contained many localized misassemblies.  Version 2 of maize aimed to correct many of these errors.   
These analyses compare the genomic sequence assembled of maize B73 refgen versions 1 and 2.  [[Sequenced_plant_genomes#Maize.2FCorn | Maize was sequenced bac by bac]], and bacs were chosen that tile across all of maize's chromosomes.  This means that the relative order of most bacs was correctly determined between and within a chromosome.  However, the sequences within a bac were often unordered, and the position of contig sequences within a bac relative to one another is not necessarily correct.  Therefore version 1 of maize contained many localized misassemblies.  Version 2 of maize aimed to correct many of these errors.   


To determine the extent of these corrected errors, [[syntenic dotplots]] can be generated between two different versions of a genome.  [[SynMap]] makes these comparisons easy to perform and provides a variety of visualization options to help identify assembly differences.  Figure 1 shows a [[syntenic dotplot]] between maize genome assemblies refgen v1 and v2.  Each
Please note that at the time of these anlayses, no gene models or annotations were available for version 2 of the maize genome.
 
To determine the extent of these corrected errors, [[syntenic dotplots]] can be generated between two different versions of a genome.  [[SynMap]] makes these comparisons easy to perform and provides a variety of visualization options to help identify assembly differences.  Figure 1 shows a [[syntenic dotplot]] between maize genome assemblies refgen v1 and v2.  In this dotplot, syntenic regions are given a colored dot (which form lines when the density is high).  These dots are colored green and blue if they are in the same or opposite orientations respectively.  There are two sets of sytnenic lines in this dotplot.  The strong lines that mostly form continuous lines from the lower-left corner of a chromosome-v-chromosome grip to the upper-right corner, and several smaller regions with a lower density of dots.  The latter regions are from the most recent whole genome duplication event in maize (for additional information on this please see [[Maize_Sorghum_Syntenic_dotplot | the maize versus sorghum dotplot]] and [[Splitting_maize_genome | splitting the maize genome into its two ancestral genomes]].) 
 
This dotplot reveals that the overall structure of these two assemblies is highly similar (for an example of comparing genome assemblies with many more differences, please see [[Syntenic_dotplot_medicago_truncatula_version_3_versus_version_2 | medicago version 1 versus version 2]].)  There is a large obvious inverted region on chromosome 3 (close-up Fig 2), and several breaks in the syntenous line showing areas where sequence was added or removed from the assembly.  However, close examination shows many blue dots intermixed with green.  These point to regions where a small inversion was made between the two version of maize assemblies.  However, at this resolution, it is not possibly to identify small movements of assembled pieces.
 
High-resolution analysis of these regions can show the details of these inversion as well as changes in the arrangement of contigs.  Figure 3 uses GEvo to analyze a 1MB region of chromosome three.  Since maize contains many highly repetitive sequences, which will severely obfuscate the results of pair-wise sequence analyses

Revision as of 16:51, 3 May 2010

Figure 1. Maize B73 refgen version 1 (x-axis) and version 2 (y-axis). Version 1 has gene models and version 2 is using only genomic sequence. Syntenic pairs (dots) are colored green and blue if in the same or opposite orientation respectively. Analysis can be regenerated at http://genomevolution.org/r/4dq


Figure 2. Syntenic dotplot of maize B73 chromosome 3 between refgen version 1 and version 2. Syntenic gene-pairs (dots) are colored green and blue if in the same or opposite orientation respectively. Note the large inversion near the middle (blue line) and many smaller inversion (blue dots). Results can be regenerated by visiting the master dotplot (http://genomevolution.org/r/4dq) and clicking on the chromosome 3 versus chromosome 3 comparison.
Figure 3. GEvo analysis of 1MB of chromosome 3 from maize between refgen version 1 and 2. Version 1 has gene models and non-CDS sequences masked (purple). Note the sets of genes that have been reoriented in version 2. These show up where regions of sequence similarity (pink blocks) are drawn below the dashed line in both panels. Results can be regenerated at http://genomevolution.org/r/4du

These analyses compare the genomic sequence assembled of maize B73 refgen versions 1 and 2. Maize was sequenced bac by bac, and bacs were chosen that tile across all of maize's chromosomes. This means that the relative order of most bacs was correctly determined between and within a chromosome. However, the sequences within a bac were often unordered, and the position of contig sequences within a bac relative to one another is not necessarily correct. Therefore version 1 of maize contained many localized misassemblies. Version 2 of maize aimed to correct many of these errors.

Please note that at the time of these anlayses, no gene models or annotations were available for version 2 of the maize genome.

To determine the extent of these corrected errors, syntenic dotplots can be generated between two different versions of a genome. SynMap makes these comparisons easy to perform and provides a variety of visualization options to help identify assembly differences. Figure 1 shows a syntenic dotplot between maize genome assemblies refgen v1 and v2. In this dotplot, syntenic regions are given a colored dot (which form lines when the density is high). These dots are colored green and blue if they are in the same or opposite orientations respectively. There are two sets of sytnenic lines in this dotplot. The strong lines that mostly form continuous lines from the lower-left corner of a chromosome-v-chromosome grip to the upper-right corner, and several smaller regions with a lower density of dots. The latter regions are from the most recent whole genome duplication event in maize (for additional information on this please see the maize versus sorghum dotplot and splitting the maize genome into its two ancestral genomes.)

This dotplot reveals that the overall structure of these two assemblies is highly similar (for an example of comparing genome assemblies with many more differences, please see medicago version 1 versus version 2.) There is a large obvious inverted region on chromosome 3 (close-up Fig 2), and several breaks in the syntenous line showing areas where sequence was added or removed from the assembly. However, close examination shows many blue dots intermixed with green. These point to regions where a small inversion was made between the two version of maize assemblies. However, at this resolution, it is not possibly to identify small movements of assembled pieces.

High-resolution analysis of these regions can show the details of these inversion as well as changes in the arrangement of contigs. Figure 3 uses GEvo to analyze a 1MB region of chromosome three. Since maize contains many highly repetitive sequences, which will severely obfuscate the results of pair-wise sequence analyses