Marijuana assembly
Obtain raw reads
Sequences obtained from: http://csativa.elasticbeanstalk.com/
Info:
The sequence data is derived from an ILMN HiSeq v2.0 chemistry with 2x100 reads. There are 7 Lanes in total which add up to 131Gb of sequence. The genome is estimated to be 400Mb thus an estimated 327X coverage.
Merge read files
cat *_1_sequence* > R1_all.fastq.gz & cat *_2_sequence* > R2_all.fastq.gz &