Variation Annotation with SnpEff

SnpEff is a tool for variant annotation and effect prediction. It annotates and predicts the effects of genetic variants, such as amino acid changes.

A typical SnpEff use case would be:

  • Input: The inputs are predicted variants (SNPs, insertions, deletions, and MNPs). The input file is usually obtained as a result of a sequencing experiment and is typically in variant call format (VCF).
  • Output: SnpEff analyzes the input variants. It annotates the variants and calculates the effects they produce on known genes, such as amino acid changes.

How to Use This Sample

If you haven’t used the workflow samples in UGENE before, take a look at the “How to Use Sample Workflows” section of the documentation.

Workflow Sample Location

The workflow sample “Variation Annotation with SnpEff” can be found in the “NGS” section of the Workflow Designer samples.

Workflow Image

The opened workflow appears as follows:

Workflow Wizard

The wizard consists of 4 pages.

  1. Input Variations: On this page, you must input variations file(s).

  2. Change Chromosome Notation for Variations: On this page, you can change the chromosome notation for variations.

    The following parameters are available:

    ParameterDescription
    Replace prefixesInput the list of chromosome prefixes you would like to replace (e.g., “NC_000”). Separate different prefixes by semicolons.
    Replace byInput the prefix that should be set instead, for example “chr”.
  3. SnpEff Parameters: On this page, you can modify SnpEff parameters.

    The following parameters are available:

    ParameterDescription
    GenomeSelect the target genome. Genome data will be downloaded if it is not found.
    Canonical transcriptsUse only canonical transcripts.
    HGVS nomenclatureAnnotate using HGVS nomenclature.
    Annotate Loss of FunctionAnnotate loss of function (LOF) and nonsense-mediated decay (NMD).
    Annotate TFBSs motifsAnnotate transcription factor binding site motifs (only available for the latest GRCh37).
    Upstream/downstream lengthSet upstream and downstream interval size. Eliminate any upstream and downstream effect by using 0 length.
  4. Output: On this page, you need to input output parameters.