Difference between revisions of "GEvo"

From CoGepedia
Jump to: navigation, search
(Sequence Submission)
Line 118: Line 118:
 
Often, there are times when you will want to merge together two or more separate GEvo anlayses.  To do this, copy a [[GEvo#GEvo_Links | GEvo link]] into the text-box next the text: "Merge Previous GEvo Analysis (paste in URL)" located at the top of the sequence submission tab.  Then press the "Merge" button".  The sequences as specified in the pasted URL will appear as new sequence submission boxes configured as specified in the link (extra up/downstream sequence, reverse complement, masked, etc.)
 
Often, there are times when you will want to merge together two or more separate GEvo anlayses.  To do this, copy a [[GEvo#GEvo_Links | GEvo link]] into the text-box next the text: "Merge Previous GEvo Analysis (paste in URL)" located at the top of the sequence submission tab.  Then press the "Merge" button".  The sequences as specified in the pasted URL will appear as new sequence submission boxes configured as specified in the link (extra up/downstream sequence, reverse complement, masked, etc.)
  
 +
=Refining an analysis=
 +
Once a GEvo analysis has run, you can change any of the analysis parameters and re-run the analysis by pressing the "Run GEvo analysis" button again.
 +
The common parameters changed are:
 +
*The extent of the genomic region analyzed.  [[Gobe#Changing_the_extent_of_a_genomic_region | The interactive results ]] make this easy with slider bars.
 +
*The algorithm used in the analysis
 +
*Masking sequences
 +
*Skipping sequences
 +
*Reverse complementing sequences
 +
*The coloration and information displayed in the result's graphics
  
 
=Example Analyses=
 
=Example Analyses=

Revision as of 22:20, 31 January 2010

GEvo - Genome Evolution Analysis
GEvo-logo.png
GEvo Screenshot.png

Typical GEvo Analysis
Software companyCoGe Team
Analysis TypeCompare multiple genomic regions for synteny and other forms of genome evolution
Working stateReleased
Tools Utilizedblastn, tblastx, blastz, CHAOS, LAGAN, DiAlign 2
Websitehttp://synteny.cnr.berkeley.edu/CoGe/GEvo.pl

GEvo is CoGe's Genome Evolution Analysis tool, designed to visually compare genomic regions using both local and global alignment algorithms.


Introduction

The purpose of GEvo is to compare multiple genomic regions from any number of organisms using a variety of different sequence comparison algorithms in order to quickly identify patterns of genome evolution

Getting started

Screen-shot of where a GEvo analysis is configured. Two genomic regions have been specified by gene name and the amount of additional upstream/downstream sequence
  1. Select genomic regions to analyze
  2. Select a sequence alignment algorithm appropriate for the sequences and questions in mind
  3. Press "Run GEvo Analysis!" button

To alternate between areas to configure an analysis, select the appropriate tab.

Sequence Submission

Select the "Sequence Submission" tab to open these options. Here, you can specify sequence submission boxes for each sequence that will be submitted for a GEvo anlaysis. This is also were you can adjust the amount of sequence analyzed, select which sequences are analyzed, reverse complement a sequence, mask a sequence according the the genomic features it contains, and change the display order of sequences.

Adding a sequence

To add another sequence submission box, press the "Add sequence" button. After pressing, a new sequence submission box will appear

Select the type of sequence

There are three types of ways that you can submit a sequence to GEvo:

  1. Using a CoGe genomic feature name
  2. Specify a GenBank Accession for automatic retrieval from NCBI
  3. Paste a sequence in Fasta or GenBank format

You can select the sequence submission type from a drop-down menu located

Specifying the amount of sequence analyzed

Skip a sequence

Making a sequence a "reference sequence"

Reverse complement a sequence

Masking a sequence

Changing the display order of sequences

Alignment Algorithms

Current GEvo can use:

  • BlastZ: DNA-DNA Local Alignment Algorithm. Good for finding large regions of conserved sequence.
  • BlastN: DNA-DNA Local Alignment Algorithm. Good for finding small regions of conserved sequence.
  • TBlastX: Translated DNA-Translated DNA Local Alignment Algorithm. Good for finding small regions of divergent, but evolutionarily conserved, genomic sequence where protein translated sequence is more conserved than DNA sequence.
  • Chaos: DNA-DNA Local Alignment Algorithm. Good for finding small regions of conserved sequence. Uses fuzzy matches so it can seed its alignment on small sequences than BlastN. However, it is slower than BlastN.
  • DiAlign: DNA-DNA Global Alignment Algorithm. Global alignment can be seeded using local alignment algorithm. Good for alignment the entire sequence.

GEvo supports using Chaos, BlastN, and BlastZ for seeding DiAlign.

  • Lagan: DNA-DNA Glocal Alignment. Using a hybrid alignment approach.

Results

GEvo's results are displayed in an interactive system called gobe that lets you connect regions of similar sequence, and get additional information about genomic features. Please follow this link for more information.

Regenerating/Saving a GEvo Analysis

GEvo-links.png

GEvo Links

After results are generated by GEvo, a URL will be created that will be a hyperlink to GEvo with your analysis pre-configured. To regenerated the results, all you need to do is press the "Run GEvo Analysis!" button and wait for the analysis to run. This link is stored in two places:

  1. At the bottom of the results under "GEvo Links" (see example image.) This link has been condensed using the tinyurl redirecting service.
  2. At the bottom of the log file. The link to the log file can also be found at the bottom of the results (see example image.)

GEvo Direct

GEvo Direct is a tool for quickly viewing the results of a previously run analysis without having to re-run the analysis. Please Note: CoGe saves all the files from a GEvo analysis for ~24 hours. After that time, the data-files are deleted and the GEvo Direct link will no longer work.

Save Work History

Registered CoGe users can save a link to a GEvo analysis for later retrieval from their work history. This permits a GEvo analysis to also be names and annotated for future reference.

Modifying result graphics

Showing Contigs

Example GEvo result with contigs, hsp labels, and genomic feature labels drawn.
Where to find GEvo's options for viewing contigs, HSP labels, and genomic feature labels.

Some genomes have contig assembly information. To view this in GEvo's results:

  1. Select the "Results Parameters" tab from GEvo's configuration box
  2. Select "yes" for the option "Color contigs red".

Turning on labels for HSPs (blast hits) in GEvo's results

If you want to have the HSP number drawn on the HSP:

  1. Select the "Results Parameters" tab from GEvo's configuration box
  2. Select "yes" for the option "Label HSPs".
  • You can have the labels drawn linearly, so each label is at the same vertical position for a track, or staggered, where they are drawn top, middle, bottom alternating.

Turning on labels for Genomic Features (e.g. genes) in GEvo's results

If you want to have the feature names drawn on the feature:

  1. Select the "Results Parameters" tab from GEvo's configuration box
  2. Select "yes" for the option "Label Genomic Features".
  • You can have the labels drawn linearly, so each label is at the same vertical position for a track, or staggered, where they are drawn top, middle, bottom alternating.

Expanding Overlapping Features and Regions of Sequence Similarity

Where to find GEvo's options for viewing overlapping genomic features and regions of sequence similarlity.
Example of GEvo result with local duplications that are obfuscated by not showing separating overlapping HSPs. Comparison is between orthologous regions of Arabidopsis thaliana and Arabidopsis lyrata. (A) No wedges drawn connecting regions of sequence similarity. (B) Wedges drawn connecting regions of sequence similarity. Note the "messy" regions where the local duplication is. Results can be regenerated at http://tinyurl.com/mokdnn .
vo results with "auto adjust" HSP and Genomic Features turned on. This causes GEvo to find genomic features and blast-hits that overlap at the same position, and drawn them such that they are separated in order to identify local duplications in a genomic region, repeat sequences, and alternatively spliced transcripts. This is a comparison between orthologous regions of Arabidopsis thaliana and Arabidopsis lyrata, and can be regenerated at http://tinyurl.com/mokdnn. Wedges have been drawn connection regions of sequence similarity between one gene in the bottom panel. This shows that this one gene has sequence similar to four regions in the orthologous genomic region, which is indicative of a local gene duplication. Also, there is a "stack" of HSPs which is caused by repeated sequences. Note that two genes have annotations for being alternatively spliced, which is visualized by separating the drawing of overlapping genomic features.

By default GEvo will drawn overlapping genomic features and regions of sequence similarity on top of one another. However, this sometimes hides some of the interesting complexities in a genomic region such as local duplications or regions containing repeated sequences. To view these, select the "Results Parameters" tab and select "Yes" for "Auto adjust overlapping features" and/or "Auto adjust overlapping HSPs". These options are set to "No" by default because finding and drawing overlapping features can take a long time to process, and are not always useful.

Merging Analyses

Often, there are times when you will want to merge together two or more separate GEvo anlayses. To do this, copy a GEvo link into the text-box next the text: "Merge Previous GEvo Analysis (paste in URL)" located at the top of the sequence submission tab. Then press the "Merge" button". The sequences as specified in the pasted URL will appear as new sequence submission boxes configured as specified in the link (extra up/downstream sequence, reverse complement, masked, etc.)

Refining an analysis

Once a GEvo analysis has run, you can change any of the analysis parameters and re-run the analysis by pressing the "Run GEvo analysis" button again. The common parameters changed are:

  • The extent of the genomic region analyzed. The interactive results make this easy with slider bars.
  • The algorithm used in the analysis
  • Masking sequences
  • Skipping sequences
  • Reverse complementing sequences
  • The coloration and information displayed in the result's graphics

Example Analyses

Analysis of syntenic regions from Arabidopsis thaliana, Carica papaya, and Vitis vinifera


Linking to GEvo

Linking to GEvo is easy! Please see this page on how.

Tutorials

References