FractBias
Background
Whole genome duplications (WGDs) and genome fractionation are covered more thoroughly in other CoGepedia entries. In short, WGDs create two or more copies of a genome: which are referred to as subgenomes. The duplicate subgenomes then undergo gene loss in a process called fractionation which is part of returning to a diploid state, diploidization. All things being equal, one may assume that fractionation would occur randomly across the redundant genes created after a WGD, however bias towards gene loss on one genome, called fractionation bias, has been observed in several species including: maize [1], Brassica rapa [2], .
Overview
What goes in
- Two assembled genomes that have annotated coding sequences (CDS)
- A syntenic ratio set by the user (identified by empiric tests outside of the FractBias tool)
- The full GFF of the target
What comes out
FractBias Methods
data:image/s3,"s3://crabby-images/345e7/345e738ee44c14bb5959674131c0f0d758178d1a" alt=""
data:image/s3,"s3://crabby-images/11900/11900d025fa7ad59d7afd094d90bfaa5f4567a1b" alt=""
FractBias is a tool used to assess fractionation bias between genomes. This is accomplished by:
User Input
- Select two genomes to compare in the SynMap tool.
- Select the SynMap 'Syntenic Depth' option under 'Analysis Options.'
- Set syntenic depth ratio between genomes (determined by empirically outside of this tool).
FractBias tool analysis
- The coordinates for syntenic regions between the genomes are determined by the SynMap tool
- The syntenic genes are then parsed according to the 'target' and 'query' genomes. The genome with the lower syntenic depth ratio is set as the target genome; the genome with the higher ratio is set as the query genome.
- A list of genes present on every target genome chromosome is made and ordered according to start site (bp) in the annotation (gff/gtf file).
- The FractBias tool then goes through the list of each target genome gene, and determines if it has a syntenic pair on one (or more) of the query chromosomes.
- Finally, the FractBias tool runs a sliding window analysis to calculate how many genes are retained for each query chromosome.
- A figure is generated that contains a subplot for every target genome chromosome
- The x-axis: target genome gene order number in sliding window according to order of start site in genome annotation (gff/gtf).
- The y-axis:
Example Output
data:image/s3,"s3://crabby-images/a261b/a261ba2399c52ebeb8c6247f939611be7aea0ca3" alt=""
data:image/s3,"s3://crabby-images/052e0/052e0a6bee3d9e654f79505542ec10b2493e9f27" alt=""
All Genes Example Data Table | |||||
---|---|---|---|---|---|
Window Iteration | X-axis Value | Genes counted in Subgenome 1 | Y-axis Value Sub 1 | Genes counted in Subgenome 2 | Y-axis Value Sub 2 |
1 | 1 | 1,2,3,4 | 100 | 1,2,3,4 | 50 |
2 | 2 | 2,3,4,5 | 75 | 2,3,4,5 | 25 |
3 | 3 | 3,4,5,6 | 75 | 3,4,5,6 | 50 |
4 | 4 | 4,5,6,8 | 75 | 4,5,6,8 | 50 |
data:image/s3,"s3://crabby-images/3a0f0/3a0f024b408d22cf6d3e9c2c247256594b5b612c" alt=""
Only Retained Genes Example Data Table | ||||||
---|---|---|---|---|---|---|
1 | ||||||
2 | ||||||
3 | ||||||
4 | ||||||
5 | ||||||
6 |
Biological Examples
Sorghum and Maize Fractionation Bias
Arabidopsis and Brassica Fractionation Bias
Esox and Oncorhynchus Fractionation Bias
References
- ↑ Schnable, J.C. et al. Dose–sensitivity, conserved non-coding sequences, and duplicate gene retention through multiple tetraploidies in the grasses. Front. Plant Sci. http://dx.doi.org/10.3389/fpls.2011.00002 (2011)
- ↑ Cheng, F. et al. Biased gene fractionation and dominant gene expression among the subgenomes of Brassica rapa. PLOS ONE DOI: 10.1371/journal.pone.0036442 (2012)