Conserved Non-Coding Sequence
Conserved noncoding sequences (CNS) are regions of the genome which do not code for proteins, yet show significantly slower rates of sequence change than truly nonfunctional sequences. Current evidence suggests that many or most conserved noncoding sequences are involved in regulating the expression of neighboring genes.[1]
CNS in plants tend to be much smaller than those found in animals.
An example of some five prime conserved noncoding sequences identified by comparing syntenic orthologs in the genomes of peach and chocolate using the CNS Discovery Pipeline. To regenerate this analysis click here . Another CNS example in the grasses.
Contents
Examples of regulatory CNS
Vgt1
Vgt1 (Vegetative to generative transition 1) was a quantitative trait locus identified based on studied of changes in flowering time within maize mapping populations. The quantitative trait was fine mapped to one a pair of conserved noncoding sequences which regulate a AP2-like gene called ZmRap2.7. In one allele of this QTL the CNS was disrupted by the insertion of a MITE transposon. These CNS are 70 KB upstream of the gene they regulate (ZmRap2.7).
Salvi, S. et al. (2007). Conserved noncoding genomic sequences associated with a flowering-time quantitative trait locus in maize. Proceedings of the National Academy of Sciences 104: 11376 -11381.
Knotted1
Knotted1 is a gene involved in the regulation of meristem identity and is the tenth most studied gene in maize. Knockouts of knotted1 tend to be lethal so much of the early work characterizing this gene was conducted on dominant mutants which show ectopic expression of knotted one in leaves resulting in very cool phenotypes as leaf cells take on a somewhat meristematic identity.[2]
These dominant mutant phenotypes are caused by transposon insertions landing within a 310 bp region of the first intron of knotted1 [2]. This region, a negative regulated of knotted1 has been shown to contain a cluster of conserved noncoding sequences (identified by comparing knotted1 to its ortholog in rice.)[3]
Lateral Suppressor
Lateral Suppressor (LAS) is a tightly regulated gene expressed specifically at the adaxial boundary of newly initiating leaf primordia. In 2011 Bodo Raatz and co-workers reported that this very specific expression pattern was regulated by a 3' prime (downstream) enhancer/suppressor which induces expression of reporter genes within the usual expression domain of LAS and represses their expression in other contexts. The authors also showed the the equivalent sequence in tomato (an asterid) retains the same function and that this conserved noncoding sequence is present downstream of orthologs of LAS in both eudicots and grasses (monocots).
Raatz, B. et al. (2011). Specific expression of LATERAL SUPPRESSOR is controlled by an evolutionarily conserved 3′ enhancer. The Plant Journal 68: 400-412.
Identifying CNS
Different groups have developed different criteria for what constitutes a CNS. For our own research we define a CNS as a blast hit at least as significant as a 15 base pair exact match present at a syntenic location relative to pair of homeologous or orthologous genes. For these criteria to work, the two genomes being compared should have a modal synonymous substitution rate of between .5 and .99. In more closely related genomes sequences which are not functionally constrained can still retain significant sequence similarity (carry over), while in genomes with greater divergence, base pair substitutions often render even functional CNS undetectable.
In animals, where conserved noncoding elements tend to be larger and evolve at a (relative to plants) glacial rate, different parameters may apply.
To automate the process of identifying and filtering conserved noncoding sequences, the Freeling lab has developed the CNS Discovery Pipeline.
- ↑ Freeling, M. and Subramaniam, S. (2009). Conserved noncoding sequences (CNSs) in higher plants. Curr. Opin. Plant Biol 12: 126-132.
- ↑ 2.0 2.1 Greene, B. et al. (1994). Mutator Insertions in an Intron of the Maize knotted1 Gene Result in Dominant Suppressible Mutations. Genetics 138: 1275 -1285.
- ↑ Inada, D.C. et al. (2003). Conserved Noncoding Sequences in the Grasses. Genome Res 13: 2030 -2041.