Extract Transcript Sequences with gffread Element
This workflow element uses gffread to extract transcript sequences from provided genomic sequences and GFF/GTF annotation files.
Element type: gffread
Parameters
| Parameter | Description | Default Value | Parameter in Workflow File | Type |
|---|---|---|---|---|
| Output sequences | Path to the output FASTA file with the extracted sequences. | (not set) | url-out | string |
Input/Output Ports
Input Port
| Name in GUI | Name in Workflow File | Slot | Slot in Workflow File | Type |
|---|---|---|---|---|
Input transcripts | in-data | Genomic sequence URL | genome | string |
| Transcripts URL | transcripts | string |
Output Port
| Name in GUI | Name in Workflow File | Slot | Slot in Workflow File | Type |
|---|---|---|---|---|
Extracted sequences URL | extracted-data | sequences | sequences | string |
Example
You can use this element in a workflow to extract coding sequences (CDS) or full transcript sequences using a reference genome and GFF/GTF file.
ugene workflow run \
--element gffread \
--genome genome.fa \
--transcripts annotations.gtf \
--url-out transcripts.fa
This will output transcripts.fa containing the sequences of all transcripts described in annotations.gtf, extracted from genome.fa.