Extract Transcript Sequences with gffread Element

This workflow element uses gffread to extract transcript sequences from provided genomic sequences and GFF/GTF annotation files.

Element type: gffread


Parameters

ParameterDescriptionDefault ValueParameter in Workflow FileType
Output sequencesPath to the output FASTA file with the extracted sequences.(not set)url-outstring

Input/Output Ports

Input Port

Name in GUIName in Workflow FileSlotSlot in Workflow FileType
Input transcriptsin-dataGenomic sequence URLgenomestring
Transcripts URLtranscriptsstring

Output Port

Name in GUIName in Workflow FileSlotSlot in Workflow FileType
Extracted sequences URLextracted-datasequencessequencesstring

Example

You can use this element in a workflow to extract coding sequences (CDS) or full transcript sequences using a reference genome and GFF/GTF file.

ugene workflow run \
  --element gffread \
  --genome genome.fa \
  --transcripts annotations.gtf \
  --url-out transcripts.fa

This will output transcripts.fa containing the sequences of all transcripts described in annotations.gtf, extracted from genome.fa.