Difference between revisions of "Marijuana assembly"

From CoGepedia
Jump to: navigation, search
(Created page with '== Obtain raw reads == Sequences obtained from: http://csativa.elasticbeanstalk.com/ Info: The sequence data is derived from an ILMN HiSeq v2.0 chemistry with 2x100 reads. The...')
 
Line 4: Line 4:
  
 
Info:
 
Info:
  The sequence data is derived from an ILMN HiSeq v2.0 chemistry with 2x100 reads. There are 7 Lanes in total which add up to 131Gb of sequence. The genome is estimated to be 400Mb thus an estimated 327X coverage.  
+
  The sequence data is derived from an ILMN HiSeq v2.0 chemistry with 2x100 reads. There are 7 Lanes in total which add up to 131Gb of sequence.  
 +
The genome is estimated to be 400Mb thus an estimated 327X coverage.  
  
 
== Merge read files ==
 
== Merge read files ==
 
  cat *_1_sequence* > R1_all.fastq.gz &
 
  cat *_1_sequence* > R1_all.fastq.gz &
 
  cat *_2_sequence* > R2_all.fastq.gz &
 
  cat *_2_sequence* > R2_all.fastq.gz &

Revision as of 09:50, 20 August 2011

Obtain raw reads

Sequences obtained from: http://csativa.elasticbeanstalk.com/

Info:

The sequence data is derived from an ILMN HiSeq v2.0 chemistry with 2x100 reads. There are 7 Lanes in total which add up to 131Gb of sequence. 
The genome is estimated to be 400Mb thus an estimated 327X coverage. 

Merge read files

cat *_1_sequence* > R1_all.fastq.gz &
cat *_2_sequence* > R2_all.fastq.gz &