Filter Sequence That Matches a Pattern

This workflow allows filtering sequences that match (or do not match) user-specified patterns.


How to Use This Sample

If you haven’t used workflow samples in UGENE before, refer to the “How to Use Sample Workflows” section of the documentation.


Workflow Sample Location

The workflow sample “Filter Sequence That Match a Pattern” is available in the Scenarios section of the Workflow Designer samples.


Workflow Image

The workflow looks as follows:


Workflow Wizard

The wizard consists of 3 pages:

1. Input sequence(s)

On this page, input sequence files must be selected.


2. Find pattern

On this page, patterns must be specified, and search parameters can be configured.

Parameters

ParameterDescriptionDefault ValueType
PatternSemicolon-separated list of patterns to search for(user-defined)string
Use pattern nameUse names of pattern sequences as annotation names (if loaded from file)Falseboolean
Max MismatchesMaximum number of mismatches allowed between a substring and a pattern0numeric
Allow Insertions/DeletionsConsider insertions/deletions in searchFalseboolean
Search in TranslationTranslate nucleotide sequence into protein and search in translated sequenceFalseboolean
Support ambiguous basesHandle ambiguous bases correctly (disables insertions/deletions when enabled)Falseboolean
Qualifier nameName of the qualifier in result annotations containing the pattern namepatternstring

3. Output data

This page allows configuring the output file.