Loading…

Performance comparison of k nearest neighbor classifier with different distance functions

In the field of pattern recognition, K Nearest Neighbor is the classifier algorithm that use distance function to measure similarity between two samples. The well-known distance function used is the Euclidean distance which sees all samples including noisy or outliers with equal important. Euclidean...

Full description

Saved in:
Bibliographic Details
Main Author: Mukahar, Nordiana
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In the field of pattern recognition, K Nearest Neighbor is the classifier algorithm that use distance function to measure similarity between two samples. The well-known distance function used is the Euclidean distance which sees all samples including noisy or outliers with equal important. Euclidean distance is highly influenced by the noisy sample or outliers, and the value returned by similarity metrics may be affected which in turn it will deteriorate the classification performance. This paper conducts experimental comparisons of several distance functions in the KNN classification including Manhattan, Angular, Chebyshev, Cosine, Euclidean, Histogram, Kalmogorov, Mahalanobis, Match and Minkowski. Evaluation of the distance function are made on the 31 selected real-world datasets of different natures from UCI repository and the results show that Manhattan performs better over other distance functions by achieving classification accuracy at 84.63%.
ISSN:0094-243X
1551-7616
DOI:10.1063/5.0192229