Alligator v. Chicken: Difference between revisions
No edit summary |
No edit summary |
||
(7 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
[[File:Alligator.png]] vs. [[File:Chicken.png]] | [[File:Alligator.png]] vs. [[File:Chicken.png]] | ||
==Syntenic Path assembly of coding sequences== | |||
[[File:Screen shot 2012-10-03 at 8.38.05 AM.png|thumb|none|800px|[[SynMap]] analysis of Alligator (x-axis) to Chicken (y-axis). Alligator contigs ordered and oriented according to their [[Syntenic Path]] to chicken chromosomes, and syntenic gene pairs are colored by their synonymous mutation rates. Note that nearly the entire chicken genome is covered by alligator contigs, but there are many alligator contigs that are not syntenically mapped to chicken. These probably contain few genes to extract a syntenic signal (minimum of three genes required by this analyis) or are made up of only non-genic (repetitive) sequences.]] | [[File:Screen shot 2012-10-03 at 8.38.05 AM.png|thumb|none|800px|[[SynMap]] analysis of Alligator (x-axis) to Chicken (y-axis). Alligator contigs ordered and oriented according to their [[Syntenic Path]] to chicken chromosomes, and syntenic gene pairs are colored by their synonymous mutation rates. Note that nearly the entire chicken genome is covered by alligator contigs, but there are many alligator contigs that are not syntenically mapped to chicken. These probably contain few genes to extract a syntenic signal (minimum of three genes required by this analyis) or are made up of only non-genic (repetitive) sequences. Also, the regions of secondary synteny (not on the 45-degree line) are probably due to most recent paleo-tetraploidy event in the vertebrate lineage (1R or 2R). Results may be regenerated at: http://genomevolution.org/r/5ec3]] | ||
[[File:Master 19558 2731.CDS-CDS.last.dag.all.go D20 g10 A3.aligncoords.gcoords ct0.w1000.spa1.ks.sr.cs1.csoS.log.nsd.hist.png|thumb|400px|none|Ks value histogram. Ks values have been log transformed and colors in histogram correspond to colored dots in dotplot. Note that the red-orange peak on the right represets noise in the analysis for incorrectly called syntenic gene pairs, pseudogenes, bad gene models, etc.]] | [[File:Master 19558 2731.CDS-CDS.last.dag.all.go D20 g10 A3.aligncoords.gcoords ct0.w1000.spa1.ks.sr.cs1.csoS.log.nsd.hist.png|thumb|400px|none|Ks value histogram. Ks values have been log transformed and colors in histogram correspond to colored dots in dotplot. Note that the red-orange peak on the right represets noise in the analysis for incorrectly called syntenic gene pairs, pseudogenes, bad gene models, etc.]] | ||
---- | ---- | ||
==Ortholog data sets== | |||
Links to data are in the "Links and Downloads" section of the results. Links of interest: | Links to data are in the "Links and Downloads" section of the results. Links of interest: | ||
# SynMap syntenic gene pairs: "Final syntenic gene-set output with GEvo links" | # SynMap syntenic gene pairs: "Final syntenic gene-set output with GEvo links" | ||
## Information on parsing the file: [http://genomevolution.org/wiki/index.php?title=SynMap#Results] | ## Information on parsing the file: [http://genomevolution.org/wiki/index.php?title=SynMap#Results] | ||
# Mapping of alligator contigs to chicken: "Syntenic Path Assembly mapping" | # Mapping of alligator contigs to chicken: "Syntenic Path Assembly mapping" | ||
Note: There is the option to screen syntenic regions using [[syntenic depth]], but this option is not compatible with the [[syntenic path assembly]]. Screening with a 1:1 syntenic depth will remove most of the older syntenic regions derived from the most recent shared tetraploidy event in these lineages (useful for identifying syntenic orthologous genes. However, you won't produce as pretty of a graphic as when you use the syntenic path assembly. | |||
#Syntenic depth 1:1 : http://genomevolution.org/r/5ed6 | |||
[[SynFind]] is another program that identifies syntenic genes using a different algorithm and procedure than SynMap. One difference is the ability to identify syntenic regions where a given gene is missing. Using this on a genome like alligator, with many small contigs, produces some interesting results. Namely, SynFind will identify many alligator contigs that are syntenic to a given chicken genomic region. | |||
# Master syntenic table between chicken and alligator: http://genomevolution.org/CoGe/SynFind.pl?fid=21867937;qdsgid=2731;dsgid=19527;ws=20;co=3;sf=1;algo=last;get_master=1 | |||
## Information on parsing this table: http://genomevolution.org/wiki/index.php/SynFind#Master_syntenic_pairs_table | |||
# SynFind analysis: http://genomevolution.org/r/5eea | |||
---- | |||
==[[Pseudoassembly]] of alligator genome to chicken== | |||
Link to generate an [[SPA]] of alligators genomics sequence to chicken: http://genomevolution.org/r/5utq | |||
By not using alligator gene models and using chicken gene models, more alligator contigs should be assembled against chicken: | |||
*more hits will be counted in alligator on an exon-by-exon basis instead of gene-by-gene (may capture smaller alligator contigs) | |||
*Missing gene models in Alligator do not create false negatives | |||
*Should capture exons at alligator contigs boundaries that were not annotated (and join alligator contigs broken in a gene. | |||
Download syntenic mapping of alligator contigs to chicken: [http://genomevolution.org/CoGe/data/diags//Alligator_mississippiensis_American_alligator/Gallus_gallus_red_jungle_fowl_strain_inbred_line_UCD001/html/master_19558_2731.genomic-CDS.last.dag.all.go_D20_g10_A3.aligncoords.gcoords_ct0.w1000.spa1.sr.cs1.csoS.log.nsd.spa_info.txt Download here!] | |||
Download pseudoassembled genomic sequence with AGP file: [http://data.iplantcollaborative.org/quickshare/2fab1573014d5df9/2731-19558.tar.gz Download here!] |
Latest revision as of 14:55, 29 October 2012
Syntenic Path assembly of coding sequences


Ortholog data sets
Links to data are in the "Links and Downloads" section of the results. Links of interest:
- SynMap syntenic gene pairs: "Final syntenic gene-set output with GEvo links"
- Information on parsing the file: [1]
- Mapping of alligator contigs to chicken: "Syntenic Path Assembly mapping"
Note: There is the option to screen syntenic regions using syntenic depth, but this option is not compatible with the syntenic path assembly. Screening with a 1:1 syntenic depth will remove most of the older syntenic regions derived from the most recent shared tetraploidy event in these lineages (useful for identifying syntenic orthologous genes. However, you won't produce as pretty of a graphic as when you use the syntenic path assembly.
- Syntenic depth 1:1 : http://genomevolution.org/r/5ed6
SynFind is another program that identifies syntenic genes using a different algorithm and procedure than SynMap. One difference is the ability to identify syntenic regions where a given gene is missing. Using this on a genome like alligator, with many small contigs, produces some interesting results. Namely, SynFind will identify many alligator contigs that are syntenic to a given chicken genomic region.
- Master syntenic table between chicken and alligator: http://genomevolution.org/CoGe/SynFind.pl?fid=21867937;qdsgid=2731;dsgid=19527;ws=20;co=3;sf=1;algo=last;get_master=1
- Information on parsing this table: http://genomevolution.org/wiki/index.php/SynFind#Master_syntenic_pairs_table
- SynFind analysis: http://genomevolution.org/r/5eea
Pseudoassembly of alligator genome to chicken
Link to generate an SPA of alligators genomics sequence to chicken: http://genomevolution.org/r/5utq
By not using alligator gene models and using chicken gene models, more alligator contigs should be assembled against chicken:
- more hits will be counted in alligator on an exon-by-exon basis instead of gene-by-gene (may capture smaller alligator contigs)
- Missing gene models in Alligator do not create false negatives
- Should capture exons at alligator contigs boundaries that were not annotated (and join alligator contigs broken in a gene.
Download syntenic mapping of alligator contigs to chicken: Download here!
Download pseudoassembled genomic sequence with AGP file: Download here!