UGENE Forum
http://ugene.net/forum/YaBB.pl
General Category >> Help and How-to >> De novo assembly with multiple sequences (CAP3)
http://ugene.net/forum/YaBB.pl?num=1558446444

Message started by Nikolas on May 21st, 2019 at 8:47pm

Title: De novo assembly with multiple sequences (CAP3)
Post by Nikolas on May 21st, 2019 at 8:47pm
Hello,

I have four reads from sanger sequencing and would like to align them with CAP3. Two of these four reads are generated with the same primer (forward), the other two with another (reverse) primer. Each primer was used twice on (theoretically) identical templates, hence four reads.

So I select Tools -> Sanger data analysis -> Reads de novo assembly (with CAP3)...
In the dialogue, I add all four .ab1 files and hit Run. The task finishes without error.

But the alignment then only shows two reads (the first two from the list). Moreover, if I instead only add the two sequences from the forward sequencing or the two from the reverse sequencing, the task aborts with the following error messages:

[ERROR][15:17] Subtask {Load document: 'PDS_1F.cap.ace'} is failed: File doesn't contain any msa objects
[ERROR][15:17] Task {CAP3 run and open result task} finished with error: Subtask {Loading documents} is failed: Subtask {Opening view for document: PDS_1F.cap.ace} is failed: Subtask {Load document: 'PDS_1F.cap.ace'} is failed: File doesn't contain any msa objects

Only read pairs with opposite direction seem to work. As far as I understand CAP3 (from reading not more than the abstract of the paper), it should be able to align any number of sequnces.

I can align the sequences fine with MUSCLE (inside UGENE) using a FASTA as input, but then I lose the quality info from the chromatograms.

Am I doing something wrong? Any help would be appreciated.

Thanks in advance,
Nikolas

MacOS 10.14.4
UGENE 1.32.0 (64 bit)

Title: Re: De novo assembly with multiple sequences (CAP3)
Post by Olga Golosova on May 22nd, 2019 at 12:30pm
Hello Nikolas,

Could you please send us the reads, we'll try to investigate the problem.


Quote:
As far as I understand CAP3 (from reading not more than the abstract of the paper), it should be able to align any number of sequnces.

In, general, yes, however, this doesn't work with these exact reads.You might try to tweak the CAP3 parameters.


Quote:
I can align the sequences fine with MUSCLE (inside UGENE) using a FASTA as input, but then I lose the quality info from the chromatograms.

As I understand, you have a contig sequence that you can use as a reference sequence. Try to use the second item in the menu, i.e. "Tools > Sanger data analysis > Map reads to reference".

Title: Re: De novo assembly with multiple sequences (CAP3)
Post by Nikolas on May 22nd, 2019 at 5:16pm
Hello Olga,

thanks for your reply.

I have attached three of the read files to this post, the fourth I will attatch to the next post, as it is only possible to attatch three 250 KB files and each of them is 150 KB.

I did not have a reference sequence and my idea was to assemble the reads without one. Now I of course can use the consensus form the MUSCLE alignment as a reference for the reads. I have tried it as a workaraound and it works fine.

It would still be more convenient to directly assemble the reads with CAP3. Which parameters would I have to tweak? As far as I can see, two  of my sequences are just being ignored and I don't see any parameters in the Advanced tab that would influence this behavior. Except maybe for the "Assembly reverse reads" flag which I have set on and off without any effect on the outcome.

Cheers,
Nikolas

http://ugene.net/forum/YaBB.pl?action=downloadfile;file=PDS_1F_ab1.zip (148 KB | 53 )
http://ugene.net/forum/YaBB.pl?action=downloadfile;file=PDS_1R_ab1.zip (145 KB | 55 )
http://ugene.net/forum/YaBB.pl?action=downloadfile;file=PDS_2F_ab1.zip (146 KB | 53 )

Title: Re: De novo assembly with multiple sequences (CAP3)
Post by Nikolas on May 22nd, 2019 at 5:17pm
Read number four.

http://ugene.net/forum/YaBB.pl?action=downloadfile;file=PDS_2R_ab1.zip (146 KB | 54 )

Title: Re: De novo assembly with multiple sequences (CAP3)
Post by Olga Golosova on May 22nd, 2019 at 7:45pm
The reason the reads are not assembled with CAP3 are different poor-quality ends of the reads. If you, for example, modify the "Clipping for poop regions" parameters of the CAP3 tool as follows:

  • "Base quality cutoff for clipping (-c)" = "30"
  • "Clipping range" = "500"

then all four reads will be present in the alignment.

However, to be able to work with the chromatograms, in the current UGENE version you'll have to map the reads to the contig sequence anyway (i.e. use CAP3, export the contig sequence from the alignment, map reads to the contig sequence).


http://ugene.net/forum/YaBB.pl?action=downloadfile;file=Contig1.fa (1 KB | 67 )
CAP3_settings.png (67 KB | 83 )
reads_mapped_to_contig.png (238 KB | 70 )

Title: Re: De novo assembly with multiple sequences (CAP3)
Post by Nikolas on May 24th, 2019 at 2:32am
This worked, thank you!

I would have expected an error message of some sort, stating that not all of my reads could be aligned.

Cheers and thanks again,
Nikolas

Title: Re: De novo assembly with multiple sequences (CAP3)
Post by Olga Golosova on May 24th, 2019 at 1:34pm
Great! Thanks for informing me about the result!

UGENE Forum » Powered by YaBB 2.5 AE!
YaBB Forum Software © 2000-2010. All Rights Reserved.