Difference between revisions of "Marijuana assembly"
From CoGepedia
(Created page with '== Obtain raw reads == Sequences obtained from: http://csativa.elasticbeanstalk.com/ Info: The sequence data is derived from an ILMN HiSeq v2.0 chemistry with 2x100 reads. The...') |
|||
Line 4: | Line 4: | ||
Info: | Info: | ||
− | The sequence data is derived from an ILMN HiSeq v2.0 chemistry with 2x100 reads. There are 7 Lanes in total which add up to 131Gb of sequence. The genome is estimated to be 400Mb thus an estimated 327X coverage. | + | The sequence data is derived from an ILMN HiSeq v2.0 chemistry with 2x100 reads. There are 7 Lanes in total which add up to 131Gb of sequence. |
+ | The genome is estimated to be 400Mb thus an estimated 327X coverage. | ||
== Merge read files == | == Merge read files == | ||
cat *_1_sequence* > R1_all.fastq.gz & | cat *_1_sequence* > R1_all.fastq.gz & | ||
cat *_2_sequence* > R2_all.fastq.gz & | cat *_2_sequence* > R2_all.fastq.gz & |
Revision as of 09:50, 20 August 2011
Obtain raw reads
Sequences obtained from: http://csativa.elasticbeanstalk.com/
Info:
The sequence data is derived from an ILMN HiSeq v2.0 chemistry with 2x100 reads. There are 7 Lanes in total which add up to 131Gb of sequence. The genome is estimated to be 400Mb thus an estimated 327X coverage.
Merge read files
cat *_1_sequence* > R1_all.fastq.gz & cat *_2_sequence* > R2_all.fastq.gz &