Welcome, Guest. Please Login or Register
UGENE Bulletin Board
  Welcome to our forum.
  HomeHelpSearchLoginRegister  
 
 
Page Index Toggle Pages: 1
Consider inclusion of CD-hit for selection of representative sequences out of large datasets (Read 137 times)
Aug 30th, 2017 at 3:42pm

Peacemaker   Offline
YaBB Newbies

Posts: 11
*
 
Thank you for the great improvements that ugene has seen over the last couple of years.
I would like to suggest to include the software cd-hit (http://cd-hit.org) as a plugin. This would help to reduce the complexity of large datasets of homologous sequences by automatically selecting representative sequences which feature identities below a certain identity threshold. I would suggest to consider including the two basic functions of the cd-hit suite, cd-hit for protein sequences and cd-hit-est for DNA sequences (http://weizhongli-lab.org/cdhit_suite/cgi-bin/index.cgi).
Thank you for your consideration.
 
IP Logged
 
Reply #1 - Aug 30th, 2017 at 8:34pm

Yuliya Algaer   Offline
Global Moderator

Posts: 110
*****
 
Dear Peacemaker,

Thanks for your request!

We will add this feature in future UGENE versions.

However, this feature is quite big and it now has middle priority for us. If you need the feature to be implemented as soon as possible, you might consider to use our commercial support services.

Anyway, thank you again for the request! We will keep it in mind, while planning the UGENE future version.
 
IP Logged
 
Page Index Toggle Pages: 1