ChIP-seq Analysis Pipeline: Difference between revisions

Latest revision as of 16:53, 4 March 2016

Note: this document is a draft and still under revision.

See the LoadExperiment tool to use the new pipeline.

This analysis pipeline was developed by Xiang Ju (in the lab of Brian Gregory at UPenn).

Three FASTQ input files are required:

Trim FASTQ files (optional)
Align FASTQ files to reference genome sequence using selected alignment software tool (GSNAP, Bowtie, etc)
1. Build index of reference sequence
2. Individually map FASTQ files to reference
Create tag directories (Homer)
Find peaks (Homer)
Load results

The pipeline produces 5 outputs (represented as "Experiments" in CoGe):

Three BAM files corresponding to each FASTQ input mapped to the reference genome sequence
Two peaks tracks corresponding to the input analyzed with respect to each replicate.

@@ Line 1: / Line 1: @@
+<span style="color:red">Note: this document is a draft and still under revision.</span>
 CoGe can analyze [https://en.wikipedia.org/wiki/ChIP-sequencing chromatin immunoprecipitation sequence (ChIP-seq)] using the software package [http://homer.salk.edu/homer/ Homer].
@@ Line 14: / Line 16: @@
 # Trim FASTQ files (optional)
-# Align FASTQ files to reference genome sequence -- these steps depend on which alignment software tool is selected (GSNAP, Bowtie, etc)
+# Align FASTQ files to reference genome sequence using selected alignment software tool (GSNAP, Bowtie, etc)
 ## Build index of reference sequence
 ## Individually map FASTQ files to reference
@@ Line 22: / Line 24: @@
 ==Outputs==
+The pipeline produces 5 outputs (represented as "Experiments" in CoGe):
+* Three BAM files corresponding to each FASTQ input mapped to the reference genome sequence
+* Two peaks tracks corresponding to the input analyzed with respect to each replicate.