Difference between revisions of "Genome masking"
From CoGepedia
(Created page with 'CoGe masks genomes by using NCBI's windowmasker: http://www.ncbi.nlm.nih.gov/pubmed/16287941 Currently, our pipeline for masking is: * Generate frequency counts for the masking ...') |
|||
(4 intermediate revisions by the same user not shown) | |||
Line 8: | Line 8: | ||
*Convert soft masking (lower case sequence data) to "X". | *Convert soft masking (lower case sequence data) to "X". | ||
** CoGe's visualizations often use purple to represent genome sequence "X" to make the identification of masked reasons easy | ** CoGe's visualizations often use purple to represent genome sequence "X" to make the identification of masked reasons easy | ||
+ | |||
+ | For users that are logged in, CoGe provides an option to mask a genome through [[GenomeInfo]] | ||
+ | |||
+ | [[File:Screen Shot 2016-02-01 at 9.20.58 AM.png]] |
Latest revision as of 10:22, 1 February 2016
CoGe masks genomes by using NCBI's windowmasker: http://www.ncbi.nlm.nih.gov/pubmed/16287941
Currently, our pipeline for masking is:
- Generate frequency counts for the masking database for the genome
- windowmasker -in <genome> -mk_counts -out <counts>
- Mask genome
- windowmasker -in <genome> -ustat <counts> -outfmt fasta -dust T -out <masked genome>
- Convert soft masking (lower case sequence data) to "X".
- CoGe's visualizations often use purple to represent genome sequence "X" to make the identification of masked reasons easy
For users that are logged in, CoGe provides an option to mask a genome through GenomeInfo