Call Variants with SAMtools Element

Calls SNPs and INDELS with SAMtools mpileup and bcftools.

Element type: call_variants

Parameters

ParameterDescriptionDefault valueParameter in Workflow FileType
Output variants fileThe URL to the file with the extracted variations.
ReferenceFile with the reference sequence for all NGS assemblies.string
Use reference fromFile (use same reference for all inputs) or Input port (different references via input port).Filestring
Illumina-1.3+ encodingAssume Illumina 1.3+ quality encoding (mpileup -6).Falseillumina13-encodingboolean
Count anomalous read pairsDo not skip anomalous read pairs (mpileup -A).Falseuse_orphanboolean
Disable BAQ computationDisable base alignment quality calculation (mpileup -B).Falsedisable_baqboolean
Mapping quality downgradeDowngrade mapping quality for mismatched reads (mpileup -C). Recommended for BWA: 50.0capq_thresnumeric
Max reads per BAMLimit number of reads per position (mpileup -d).250max_depthnumeric
Extended BAQ computationImproves MNP sensitivity (mpileup -E).Falseext_baqboolean
BED or position list fileRegions to generate pileup for (mpileup -l).bedstring
Pileup regionRegion to generate pileup (mpileup -r).regstring
Minimum mapping qualityMinimum mapping quality for alignments (mpileup -q).0min_mqnumeric
Minimum base qualityMinimum base quality (mpileup -Q).13min_baseqnumeric
Gap extension errorError probability for gap extension (mpileup -e).20extQnumeric
Homopolymer error coefficientError model coefficient for indels in homopolymers (mpileup -h).100tandemQnumeric
No INDELsDo not call INDELs (mpileup -I).Falseno_indelboolean
Max INDEL depthSkip INDELs above per-sample depth (mpileup -L).250max_indel_depthnumeric
Gap open errorGap open error probability (mpileup -o).40openQnumeric
Platform list for indelsComma-separated list of platforms for indels (mpileup -P).pl_liststring
Retain all alternatesKeep all alternate alleles (bcftools view -A).Falsekeepaltboolean
Indicate PLPL from older SAMtools versions (bcftools view -F).Falsefix_plboolean
No genotype informationSuppress genotypes (bcftools view -G).Falseno_genoboolean
A/C/G/T onlySkip sites where REF is not A/C/G/T (bcftools view -N).Falseacgt_onlyboolean
List of sitesLimit to listed sites (bcftools view -l).bcf_bedstring
QCALL likelihoodOutput QCALL format (bcftools view -Q).Falseqcallboolean
Sample listFile with sample names and ploidy (bcftools view -s).samplesstring
Min sample fractionSkip loci if < fraction of samples are covered (bcftools view -d).0min_smpl_fracnumeric
Per-sample genotypesCall genotypes (bcftools view -g).Truecall_gtboolean
INDEL-to-SNP ratioExpected mutation ratio (bcftools view -i).-1indel_fracnumeric
**Max P(refD)**Site considered variant if probability of reference is below (bcftools view).0.5pref
Allele frequency priorAllele frequency prior: full, cond2, flat, or previous run output (bcftools view -P).fullptypestring
Mutation rateMutation rate (bcftools view -t).0.001thetanumeric
Pair/trio callingEnable pair/trio calling (bcftools view -T).ccallstring
N group-1 samplesNumber of group-1 samples for association test (bcftools view -1).0n1numeric
N permutationsPermutations for association test (bcftools view -U).0n_permnumeric
Min P(chi²)Threshold for P(chi²) to perform permutations.0.01min_perm_pnumeric
Min RMS qualityMinimum RMS mapping quality for SNPs (varFilter -Q).10min-qualnumeric
Min read depthMinimum read depth (varFilter -d).2min-depnumeric
Max read depthMaximum read depth (varFilter -D).10000000max-depnumeric
Alternate basesMinimum alternate base count (varFilter -a).2min-alt-basesnumeric
Gap sizeSNPs within this distance of a gap are filtered (varFilter -w).3gap-sizenumeric
Window sizeFiltering window for adjacent gaps (varFilter -W).10windownumeric
Strand biasP-value threshold for strand bias (varFilter -1).0.0001min-strandnumeric
BaseQ biasMinimum P-value for baseQ bias (varFilter -2).1e-100min-baseQstring
MapQ biasMinimum P-value for mapping quality bias (varFilter -3).0min-mapQnumeric
End distance biasMinimum P-value for bias near sequence ends (varFilter -4).0.0001min-end-distancenumeric
HWEMinimum P-value for Hardy-Weinberg Equilibrium + F<0 (varFilter -e).0.0001min-hwenumeric
Log filteredPrint filtered variants to log (varFilter -p).Falseprint-filteredboolean

Input/Output Ports

The element has 2 input ports:

Name in GUI: Input assembly
Name in Workflow File: in-assembly

Slot In GUISlot in Workflow FileType
Dataset namedatasetstring
Source urlurlstring

Name in GUI: Input sequences
Name in Workflow File: in-sequence

Slot In GUISlot in Workflow FileType
Source urlurlstring

And 1 output port:

Name in GUI: Output variations
Name in Workflow File: out-variations

Slot In GUISlot in Workflow FileType
Variation trackvariation-trackvariation