Assembly Sequences with CAP3

CAP3 is a contig assembly program. It allows the assembly of long DNA reads (up to 1000 bp).

Binaries can be downloaded from http://seq.cs.iastate.edu/cap3.html
Huang, X. and Madan, A. (1999) CAP3: A DNA Sequence Assembly Program, Genome Research, 9: 868-877.

Element type: cap3

Parameters

ParameterDescriptionDefault valueParameter in Workflow FileType
Output fileWrite assembly results to this output file in ACE format.result.aceout-filestring
Quality cutoff for clippingBase quality cutoff for clipping (-c).12clipping-cutoffnumeric
Clipping rangeSet a number whose unit is base. It will get the refGenes in n bases from the peak center. (–distance).100clipping-rangenumeric
Quality cutoff for differencesBase quality cutoff for differences (-b).20diff-cutoffnumeric
Maximum difference scoreMax qscore sum at differences (-d). If an overlap contains numerous differences at bases of high quality, the overlap is removed.200diff-max-qscorenumeric
Match score factorAffects the similarity score of an overlap (-m).2match-score-factornumeric
Mismatch score factorAffects the similarity score of an overlap (-n).-5mismatch-score-factornumeric
Gap penalty factorAffects the similarity score of an overlap (-g).6gap-penalty-factornumeric
Overlap similarity score cutoffIf the similarity score of an overlap is less than this cutoff (-s), the overlap is removed.900overlap-sim-score-cutoffnumeric
Overlap length cutoffOverlaps must be at least this long in bp (-o).40overlap-length-cutoffnumeric
Overlap percent identity cutoffOverlaps must meet or exceed this percent identity (-p).90overlap-perc-id-cutoffnumeric
Max number of word matchesUpper limit for the number of word matches considered (-t). Controls the trade-off between accuracy and speed.300max-num-word-matchesnumeric
Band expansion sizeCAP3 expands alignment bands by this size (-a).20band-exp-sizenumeric
Max gap length in an overlapMaximum length of gap in overlaps allowed (-f).20max-gap-in-overlapnumeric
Assembly reverse readsWhether to consider reverse orientation reads in assembly (-r).Trueassembly-reverseboolean
CAP3 tool pathPath to the CAP3 executable.defaultpathstring
Temporary directoryDirectory for temporary files.defaulttmp-dirstring

Input/Output Ports

The element has 1 input port:

Name in GUI: Input sequences
Name in Workflow File: in-data
Slots:

Slot In GUISlot in Workflow FileType
Dataset namedatasetstring
Input URL(s)in.urlstring