MAKER Test

From CoGepedia
Jump to navigation Jump to search

MAKER is a genome annotation pipeline[1]. It allows for a researcher or group of researchers to take a genome, some amount of evidence (for example, an EST file, a protein file (both in FASTA format), and a repeat file, and potentially more), and create structural annotations for a genome. It is capable of training HMM files in order to provide better annotations for a genome with little evidence, although this takes many runs. This page is an attempt to document the work being done to add MAKER into CoGe.

How to Download and Install MAKER

MAKER may be downloaded from the Yandell lab, here: http://www.yandell-lab.org/software/maker.html. The full installation instructions may be found here: http://weatherby.genetics.utah.edu/MAKER/wiki/index.php/MAKER_Tutorial. The instructions here will just serve as a brief overview for getting MAKER running on the command line in UNIX.

1. Register and download MAKER from the Yandell lab MAKER software page.

2. Unpack MAKER in whichever folder it will be run from.

3. Download and install prerequisites if they are not installed. The minimum prerequisites are:

    a. BioPerl and various other Perl modules (see the MAKER documentation for a complete list[2]).
    b. SNAP
    c. Exonerate
    d. RepeatMasker
    e. NCBI BLAST

4. Add MAKER and its prerequisites to $PATH. For example, the paths might look something like:

    a. MAKER: export PATH="/home/user/maker/bin:$PATH"
    b. RepeatMasker: export PATH="/home/user/RepeatMasker:$PATH"
    c. Exonerate: export PATH="/home/user/exonerate-2.2.0-x86_64/bin:$PATH"
    d. SNAP: export PATH="/home/user/snap:$PATH"


Running MAKER from the UNIX command line

1. Set up the MAKER control files by typing "maker -CTL".

    a. To set the genome, enter the path to the genome after "genome=". So, this might look like "genome=dpp_contig.fasta".
       Leave a space between this and the commented description (which starts with a "#" symbol).
    b. Set the EST or mRNA data by typing the path to the desired EST or mRNA fasta file after "est=".
    c. To have MAKER generate structural annotations directly from EST or mRNA data, change "est2genome=0" to "est2genome=1".
    d. Change any other desired settings. For more details on these settings and how to set them, see the full MAKER tutorial[3].

2. Open the MAKER boot options file ("maker_bopts.ctl") and ensure that the correct BLAST search type is selected.

    a. For example, to use NCBI-BLAST, set "blast_type=ncbi+".

3. Edit the MAKER options file ("maker_opts.ctl") to the desired settings.

4. Run MAKER by typing "maker" on the command line.

    a. To prevent MAKER from eating up too many system resources, use the "nice" command.
    b. To run MAKER in the background, and to avoid being spammed with constant messages, send the messages to both stdout and stderr with "&> somefile.txt".
    c. Putting both "a" and "b" together, the command might look something like, "nice maker &> log.txt".

5. MAKER may take about a day or more to run, depending on the size of the genome it is attempting to annotate and the evidence it is given. Once finished, it should place all files into a "genomename.maker.output" folder, with all the data split into separate folders by contigs. Each one of these folders will have several sub-folders, one of which should contain a "chromosomename.gff" file which contains the structural annotations, repeats, and will always end with the contig sequence.


<UNDER CONSTRUCTION>