Loading…

Comparison analysis of K-Means and K-Medoid with Ecluidience Distance Algorithm, Chanberra Distance, and Chebyshev Distance for Big Data Clustering

This study aims to analyze the comparison of object clustering results in big data using K-Means method and K-Medoid method. Combination testing using three algorithms namely Ecludiance Distance, Canberra Distance, and Chebyshev Distance respectively for both methods. The sample of the research is t...

Full description

Saved in:
Bibliographic Details
Published in:IOP conference series. Materials Science and Engineering 2018-10, Vol.420 (1), p.12092
Main Authors: Gultom, Syawal, Sriadhi, S., Martiano, M., Simarmata, Janner
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This study aims to analyze the comparison of object clustering results in big data using K-Means method and K-Medoid method. Combination testing using three algorithms namely Ecludiance Distance, Canberra Distance, and Chebyshev Distance respectively for both methods. The sample of the research is the data set of students of three classes with six variables with a total of 147,679 data. The test results found that K-Means method is more optimal in data clustering than K-Medoid method, both in Ecluid Distance, Chanberra Distance and Chebyshev Distance algorithms which in overall comparison of clustering process with 1: 110.7 rationality. The results also suggested not to useChanberra Distance algorithms for both K-Means and K-Medoid methods because the cluster quality index in the Davies Bouldin test was undefined (∞). The best level of accuracy and quality of cloning is to use K-Means with Chebyshev Distance method, which is to produce five clusters with 0.1 second processing time
ISSN:1757-8981
1757-899X
1757-899X
DOI:10.1088/1757-899X/420/1/012092