Difference between revisions of "Dotplot"
Line 1: | Line 1: | ||
− | [[Image:K12-MG1655-DH10B- | + | [[Image:K12-MG1655-DH10B-syn_dotplot.png|thumb|500px|right|Syntenic dotplot between two substrains of Escherichia coli K12.]] |
Dotplots are used to determine the percent similarity between two sequences, usually DNA or protein, but are also used for whole genome alignments to detect syntenic regions. Each axis of a dotplot represents the linear arrangement of one sequence being compared. A short segment of one sequence (also called window size) is compared with all possible segments of the same length in the second sequence. This generates a matrix of several alignments between the two sequences. For each pair of alignments, the similarity between the sequences' residues is scored based on the probability with which the various pairs of aligned residues replace one another. If the similarity between any two segments exceeds a threshold value, then a positive score is registered, and a dot is displayed in the dotplot. Weak signals of homology could also be detected by manipulating this threshold value. | Dotplots are used to determine the percent similarity between two sequences, usually DNA or protein, but are also used for whole genome alignments to detect syntenic regions. Each axis of a dotplot represents the linear arrangement of one sequence being compared. A short segment of one sequence (also called window size) is compared with all possible segments of the same length in the second sequence. This generates a matrix of several alignments between the two sequences. For each pair of alignments, the similarity between the sequences' residues is scored based on the probability with which the various pairs of aligned residues replace one another. If the similarity between any two segments exceeds a threshold value, then a positive score is registered, and a dot is displayed in the dotplot. Weak signals of homology could also be detected by manipulating this threshold value. |
Latest revision as of 15:45, 21 September 2009
Dotplots are used to determine the percent similarity between two sequences, usually DNA or protein, but are also used for whole genome alignments to detect syntenic regions. Each axis of a dotplot represents the linear arrangement of one sequence being compared. A short segment of one sequence (also called window size) is compared with all possible segments of the same length in the second sequence. This generates a matrix of several alignments between the two sequences. For each pair of alignments, the similarity between the sequences' residues is scored based on the probability with which the various pairs of aligned residues replace one another. If the similarity between any two segments exceeds a threshold value, then a positive score is registered, and a dot is displayed in the dotplot. Weak signals of homology could also be detected by manipulating this threshold value.