Loading…

Accelerating k-medoid-based algorithms through metric access methods

Scalable data mining algorithms have become crucial to efficiently support KDD processes on large databases. In this paper, we address the task of scaling up k-medoid-based algorithms through the utilization of metric access methods, allowing clustering algorithms to be executed by database manageme...

Full description

Saved in:
Bibliographic Details
Published in:The Journal of systems and software 2008-03, Vol.81 (3), p.343-355
Main Authors: Barioni, Maria Camila N., Razente, Humberto L., Traina, Agma J.M., Traina, Caetano
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Scalable data mining algorithms have become crucial to efficiently support KDD processes on large databases. In this paper, we address the task of scaling up k-medoid-based algorithms through the utilization of metric access methods, allowing clustering algorithms to be executed by database management systems in a fraction of the time usually required by the traditional approaches. We also present an optimization strategy that can be applied as an additional step of the proposed algorithm in order to achieve better clustering solutions. Experimental results based on several datasets, including synthetic and real ones, show that the proposed algorithm can reduce the number of distance calculations by a factor of more than three thousand times when compared to existing algorithms, while producing clusters of equivalent quality.
ISSN:0164-1212
1873-1228
DOI:10.1016/j.jss.2007.06.019