Fractionation Mutagenesis
What is fractionation mutagenesis?
Natural promoter bashing is a technique for developing testable hypothesis about the function of promoter regions or conserved promoter elements by taking advantage of fractionation mutagenesis using a combination of comparative genomics and comparative expression studies.
Whole genome duplications creates two copies of every gene in a genome, each copy with identical promoters containing identical regulatory elements. We already know that duplicate copies of many genes are lost following whole genome duplications, usually by short to medium sized deletions. Genomes of species like maize, where a whole genome duplication occurred 5-12 million years ago, still contain gene fragments that show evidence of "bites" taken out of them. (Should I show an example of this?)
Deletions are not confined to the coding regions of genes but can also remove regions of upstream regulatory sequence. (Show example).
Natural promoter bashing starts by identifying duplicate genes which show dissimilar patterns of expression with regards to some criteria a researcher is interested in. Perhaps one copy of the gene is expressed only in a certain cell type and the other is not. Or one gene is upregulated in response to a stimulus like drought stress and the other is not. Or one gene shows a change of expression in a mutant background and the other does not.
It is important that the difference observed is a difference in pattern of expression rather than absolute level of expression. Whole genome duplicates can show identical patterns of expression while being expressed at very different absolute levels. It is thought that these differences are mediated by chromatin environment rather than specific deletions/insertions in promoters of one gene or the other.
For this example we will use a duplicate pair of genes in maize which show a very different pattern of expression in pollen relative to other tissues. (Example)
Based on this pattern of expression we can hypothesize that gene X has lost a pollen specific enhancer or gene Y has lost a pollen specific repressor. Both of the models proposed above make assumption that a a regulatory sequence has been LOST from one gene rather than one gene gaining a new piece of regulatory DNA. While loss of function mutations should be much more common than gain of function ones, check out the "Gotchas to Look Out For" section below.
To track changes in the promoter sequence surrounding these two genes we can compare both genes to their shared sorghum ortholog. Sorghum diverged from maize around the same time as the maize whole genome duplication, so even functionless sequence should still show some detectable similarity between sorghum and maize (assuming it hasn't been deleted).
(Example).
As you can see (x,y,z). However, if we want to narrow down our candidate promoter regions even more, we can do that using At the time of a whole genome duplication, both copies of a gene possess identical regulatory sequences, and we therefore expect both duplicate genes to show comparable patterns of expression. When two duplicate genes from a whole genome duplication show dissimilar patterns of expression it seems reasonable to assume changes have occurred to their promoters.