Loading…

DySC: software for greedy clustering of 16S rRNA reads

Pyrosequencing technologies are frequently used for sequencing the 16S ribosomal RNA marker gene for profiling microbial communities. Clustering of the produced reads is an important but time-consuming task. We present Dynamic Seed-based Clustering (DySC), a new tool based on the greedy clustering a...

Full description

Saved in:
Bibliographic Details
Published in:Bioinformatics 2012-08, Vol.28 (16), p.2182-2183
Main Authors: ZEJUN ZHENG, KRAMER, Stefan, SCHMIDT, Bertil
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Pyrosequencing technologies are frequently used for sequencing the 16S ribosomal RNA marker gene for profiling microbial communities. Clustering of the produced reads is an important but time-consuming task. We present Dynamic Seed-based Clustering (DySC), a new tool based on the greedy clustering approach that uses a dynamic seeding strategy. Evaluations based on the normalized mutual information (NMI) criterion show that DySC produces higher quality clusters than UCLUST and CD-HIT at a comparable runtime. DySC, implemented in C, is available at http://code.google.com/p/dysc/ under GNU GPL license.
ISSN:1367-4803
1367-4811
1460-2059
DOI:10.1093/bioinformatics/bts355