Interpreting Dotplot - Identifying Matches, Mutations, Inversions, etc

Using a dotplot graphic, you can identify the following differences between the sequences:

1. Matches

A match between sequences appears as a diagonal line on the dotplot graphic, representing a continuous match (or repeat).

2. Frame shifts

a. Mutations

Mutations are differences between sequences. On the graphic, they are represented by gaps in diagonal lines. These gaps interrupt matches.

b. Insertions

Insertions are parts of one sequence that are missing in another, while the surrounding parts match. In other words, an insertion is a subsequence that was inserted into a sequence.

Graphically, insertions are represented by gaps that lie only on one axis. A slight shift towards the other axis indicates a mutation is involved.

c. Deletions

A deletion is a subsequence that has been removed from a sequence.

A deletion from sequence A, found in sequence B, can be considered an insertion into sequence B when contained in sequence A.

Example of Dotplot

3. Inverted repeats

The Dotplot plugin also allows for the search of inverted repeats. Inverted repeats are shown as opposed to direct repeats.

Use the Search direct repeats and Search inverted repeats options in the Dotplot parameters dialog to select which repeats to depict (the dialog is described here).

Inverted Repeats Example

4. Low-complexity regions

A low-complexity region is characterized by redundancy in a particular part of the sequence. It is represented on a plot as a rectangular area filled with matches.

Low-complexity Regions Example

Hint

Compare a sequence with itself to easily find low-complexity regions within it.