Loading…
Fast K-means algorithm based on a level histogram for image retrieval
•The level histogram is used with the K-means algorithm for clustering data.•The fast K-means algorithm was effectively applied to image database sets.•The fast K-means algorithm improved the efficiency of the traditional K-means algorithm.•The fast K-means algorithm processed images efficiently as...
Saved in:
Published in: | Expert systems with applications 2014-06, Vol.41 (7), p.3276-3283 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •The level histogram is used with the K-means algorithm for clustering data.•The fast K-means algorithm was effectively applied to image database sets.•The fast K-means algorithm improved the efficiency of the traditional K-means algorithm.•The fast K-means algorithm processed images efficiently as images increases.•The selection of the initial cluster centers affected the performance.
In image retrieval, the image feature is the main factor determining accuracy; the color feature is the most important feature and is most commonly used with a K-means algorithm. To create a fast K-means algorithm for this study, first a level histogram of statistics for the image database is made. The level histogram is used with the K-means algorithm for clustering data. A fast K-means algorithm not only shortens the length of time spent on training the image database cluster centers, but it also overcomes the cluster center re-training problem since large numbers of images are continuously added into the database. For the experiment, we use gray and color image database sets for performance comparisons and analyzes, respectively. The results show that the fast K-means algorithm is more effective, faster, and more convenient than the traditional K-means algorithm. Moreover, it overcomes the problem of spending excessive amounts of time on re-training caused by the continuous addition of images to the image database. Selection of initial cluster centers also affects the performance of cluster center training. |
---|---|
ISSN: | 0957-4174 1873-6793 |
DOI: | 10.1016/j.eswa.2013.11.017 |