CNS Discovery Pipeline

From CoGepedia
Revision as of 21:18, 30 March 2012 by Jschnable (talk | contribs)
Jump to navigation Jump to search

 The CNS Discovery Pipeline is a suite of software developed within the Freeling lab to identify orthologs or homeologs and compare the regions surrounding the genes to identify regions of noncoding DNA which have retained greater than expected sequence similarity, indicating the sequences (conserved noncoding sequences or CNS) are under functional constraint.

The CNS Discovery Pipeline was originally created by Brent Pedersen. His version of the software is avaliable on github.

Ongoing development of the CNS Discovery Pipeline in now handled by Gina Turco. Download the latest version of the source code here

Example

The above example compares the coding sequence surrounding two homeologs from [Plant paleopolyploidy#2 Arabidopsis alpha|the Arabidopsis alpha tetraploidy] AT1G03170 and AT4G02810. Manually identified CNS based on blast searchers are marked below the dotted line in teal and CNS identified above the dotted line are drawn above the dotted line in purple. This example analysis can be regenerated using this link: http://genomevolution.org/r/4bpo

Publications Utilizing the CNS Discovery Pipeline

May be out of date.

Public CNS Datasets

Rice-Sorghum orthologous CNS

These CNS were identified by comparing syntenic orthologs in rice (TIGR5) And Sorghum (v1.4) using version 2.0 of the CNS Discovery Pipeline. The citation for this dataset is Schnable JC et al 2011, above.