Ramosa2 orthologs and CNSs
A CoGe Walkthrough – ramosa2 ortholog identification and CNS analysis
In this exercise you will use genomic synteny to identify putative ramosa2 orthologs across several cereal genomes. In addition you will identify Conserved Non-Coding Sequences (CNSs) in the ramosa2 promoter region.
First we need to get some ra2 sequence to use in CoGe Blast. Search the NCBI nucleotide database for: NM_001138446. Copy the Fasta sequence of the mRNA.
Now go to the website for CoGe: http://synteny.cnr.berkeley.edu/CoGe/ (you can just google CoGe and you’ll see it a few pages down). Click on “CoGe Blast” and paste your sequence in the “Sequences field”. It is not necessary to have a Fasta header but it helps keep track of things.
Now we need to decide which organisms we will Blast. Only (mostly) completed genome sequences are available from this window (more on how to add other sequences later on). In the “Organism Name” field search for Sorghum, Rice, Zea mays, and Brachypodium and add them in turn by hitting the “+add” button. I find that it is best to add both the pseudomolecule maize sequence as well as the sequenced BACs as shown in the picture below. Make sure you don’t accidently add the chloroplast genomes instead of the nuclear!
When you’re ready, hit “Run CoGe Blast” to start your analysis.
Note, this is just like Blast, only you are blasting several whole genome sequences at the same time. You can enter protein sequences, change the Blast algorithm and adjust parameters just as you would in any Blast analysis.
Your results will show both the genomic locations and a table of the “blast hits” or high scoring pairs (HSPs). The HSPs are initially ordered by organism, but you can order them by any of the criteria at the top. To sort HSPs by multiple criteria simply hit one criteria then hold shift while selecting another. I often sort by HSP# then organism. This will give you the top hits in all blasted genomes. You can evaluate the results much like any blast analysis. At this point we will pick which genomic locations to examine for gene synteny. Select the top hit from each genome.