Assembly Sequences with CAP3
CAP3 is a contig assembly program. It allows the assembly of long DNA reads (up to 1000 bp).
Binaries can be downloaded from http://seq.cs.iastate.edu/cap3.html
Huang, X. and Madan, A. (1999) CAP3: A DNA Sequence Assembly Program, Genome Research, 9: 868-877.
Element type: cap3
Parameters
| Parameter | Description | Default value | Parameter in Workflow File | Type | 
|---|---|---|---|---|
| Output file | Write assembly results to this output file in ACE format. | result.ace | out-file | string | 
| Quality cutoff for clipping | Base quality cutoff for clipping (-c). | 12 | clipping-cutoff | numeric | 
| Clipping range | Set a number whose unit is base. It will get the refGenes in n bases from the peak center. (–distance). | 100 | clipping-range | numeric | 
| Quality cutoff for differences | Base quality cutoff for differences (-b). | 20 | diff-cutoff | numeric | 
| Maximum difference score | Max qscore sum at differences (-d). If an overlap contains numerous differences at bases of high quality, the overlap is removed. | 200 | diff-max-qscore | numeric | 
| Match score factor | Affects the similarity score of an overlap (-m). | 2 | match-score-factor | numeric | 
| Mismatch score factor | Affects the similarity score of an overlap (-n). | -5 | mismatch-score-factor | numeric | 
| Gap penalty factor | Affects the similarity score of an overlap (-g). | 6 | gap-penalty-factor | numeric | 
| Overlap similarity score cutoff | If the similarity score of an overlap is less than this cutoff (-s), the overlap is removed. | 900 | overlap-sim-score-cutoff | numeric | 
| Overlap length cutoff | Overlaps must be at least this long in bp (-o). | 40 | overlap-length-cutoff | numeric | 
| Overlap percent identity cutoff | Overlaps must meet or exceed this percent identity (-p). | 90 | overlap-perc-id-cutoff | numeric | 
| Max number of word matches | Upper limit for the number of word matches considered (-t). Controls the trade-off between accuracy and speed. | 300 | max-num-word-matches | numeric | 
| Band expansion size | CAP3 expands alignment bands by this size (-a). | 20 | band-exp-size | numeric | 
| Max gap length in an overlap | Maximum length of gap in overlaps allowed (-f). | 20 | max-gap-in-overlap | numeric | 
| Assembly reverse reads | Whether to consider reverse orientation reads in assembly (-r). | True | assembly-reverse | boolean | 
| CAP3 tool path | Path to the CAP3 executable. | default | path | string | 
| Temporary directory | Directory for temporary files. | default | tmp-dir | string | 
Input/Output Ports
The element has 1 input port:
Name in GUI: Input sequences
Name in Workflow File: in-data
Slots:
| Slot In GUI | Slot in Workflow File | Type | 
|---|---|---|
| Dataset name | dataset | string | 
| Input URL(s) | in.url | string |