OrganismView

From CoGepedia
Revision as of 12:31, 29 December 2009 by Elyons (Talk | contribs)

Jump to: navigation, search

OrganismView is CoGe's tool for searching for the genome of an organism of interest, and getting an overview of genomic information

Introduction

CoGe is designed to store multiple versions of any genome from multiple organisms from all domains of life in any state of assembly and annotation. This includes bacteria, archaea, eukaryotes, organelles, viruses, and sub-genomes such as plasmids. The genomic sequence can also exist in different states such as being partially assembled, fully assembled, completely unmasked, masked for repeats, etc. Also, there can exist different sets of genomic features and annotations that. OrganismView allows users to get detailed information about the genomes available for a given organism, and provides links to other tools in CoGe to extract and visualize various types of genomic information.

Getting Started

How OrganismView appears when first loaded. You can search for you organism by name (Genus species) or by description (Linnaean lineage).

Most organisms in CoGe use the scientific binomen (i.e. Genus species; e.g. Escherichia coli) for their name and full Linnaean lineage for their description (e.g. Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales; Enterobacteriaceae; Escherichia). To search for an organism, type any part of their name or description in OrganismView's "Organism Name" or "Organism Description" search box respectively. OrganismView will start searching for anything that matches and displays those organisms in a selectable list below the header "Organisms:". The small number next to the "Organisms:" header is the count of the number of organisms whose name or description matched your search term. Next, just scroll through the list and select your organism. Information about it will start to automatically appear in the other sections of OrganismView.

Organization and Information

Searching for organisms whose name contains 'arabid'.

When an organism is selected, various types of information are shown in varying degrees of scope (listed largest to smallest):

  • Organism -- top level list of organisms
  • Genome -- whole genome information
  • Dataset -- a given genome is comprised of one or more datasets. Different genomic resources organism genomic information differently and this allows for a representation of how an organism's genome was acquired. For example, each chromosome may come from a separate data file.
  • Chromosome -- the list of chromosomes for a selected dataset.

OrganismView is organized such that the above information is listed from the top to the bottom of the screen. Each scope level is organized such that selectable lists for the scope is shown on the left of the screen, and information about the selection is shown to the right.

Organism Information

Shows the name and description for the selected organism.

Genome Information

Overview of the genome:

  • Chromosome count (will be very high for partially assembled genomes)
  • Sequence type: Unmasked sequence, masked sequence
  • Total length: For all datasets making up this genome which may include plasmids, organelles, etc. depending on how the "genome" was defined by whomever sequenced the genome. This will automatically calculate the percent GC for genomes smaller than 10 megabases, otherwise the user can click on a link to calculate percent GC content.
  • Non-coding sequence: A link that will calculate the length and GC content of non-protein coding sequence
  • Link to generate a summary table of all features in the genome.

Dataset Information

Chromosome Information

Genomic Data

GC content

  • Total
  • Non-coding

Feature Lists

Links

Genome Viewer

Get Sequence

Linking to OrganismView

It is relatively easy to link directly into OrganismView to search for an organism or retrieve a specific organism. Please see Linking to OrganismView for more information.