Difference between revisions of "Dotplot"

From CoGepedia
Jump to: navigation, search
(Created page with 'Dot plots are used to determine the percent similarity between two protein sequences. A short segment of one sequence also called window size, is compared with all possible segme...')
 
 
(3 intermediate revisions by the same user not shown)
Line 1: Line 1:
Dot plots are used to determine the percent similarity between two protein sequences. A short segment of one sequence also called window size, is compared with all possible segments of the same length in the second sequence. This generates a matrix of several alignments between the two sequences. For each pair of alignment, the similarity between the amino acid residues is scored based on the probability with which the various pairs of aligned residues replace each other and the similarity in five physical parameters of the amino acid residues. If the similarity between any two segments exceeds the threshold value then a positive score is registered which is displayed as a dot. Weak signals of homology could also be detected by manipulating this threshold value specifically by increasing the mismatch limit and the window size. <br>
+
[[Image:K12-MG1655-DH10B-syn_dotplot.png|thumb|500px|right|Syntenic dotplot between two substrains of Escherichia coli K12.]]
 +
 
 +
Dotplots are used to determine the percent similarity between two sequences, usually DNA or protein, but are also used for whole genome alignments to detect syntenic regions. Each axis of a dotplot represents the linear arrangement of one sequence being compared. A short segment of one sequence (also called window size) is compared with all possible segments of the same length in the second sequence. This generates a matrix of several alignments between the two sequences. For each pair of alignments, the similarity between the sequences' residues is scored based on the probability with which the various pairs of aligned residues replace one another. If the similarity between any two segments exceeds a threshold value, then a positive score is registered, and a dot is displayed in the dotplot. Weak signals of homology could also be detected by manipulating this threshold value.

Latest revision as of 15:45, 21 September 2009

Syntenic dotplot between two substrains of Escherichia coli K12.

Dotplots are used to determine the percent similarity between two sequences, usually DNA or protein, but are also used for whole genome alignments to detect syntenic regions. Each axis of a dotplot represents the linear arrangement of one sequence being compared. A short segment of one sequence (also called window size) is compared with all possible segments of the same length in the second sequence. This generates a matrix of several alignments between the two sequences. For each pair of alignments, the similarity between the sequences' residues is scored based on the probability with which the various pairs of aligned residues replace one another. If the similarity between any two segments exceeds a threshold value, then a positive score is registered, and a dot is displayed in the dotplot. Weak signals of homology could also be detected by manipulating this threshold value.