Loading…

A hybrid approach to outlier detection based on boundary region

► We propose a hybrid approach to outlier detection. ► Our method combines the opinions from boundary-based and distance-based methods. ► Our method adopts different attitudes to objects from different parts of the data set. ► We define a hybrid outlier factor to indicate the degree of outlierness o...

Full description

Saved in:
Bibliographic Details
Published in:Pattern recognition letters 2011-10, Vol.32 (14), p.1860-1870
Main Authors: Jiang, Feng, Sui, Yuefei, Cao, Cungen
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:► We propose a hybrid approach to outlier detection. ► Our method combines the opinions from boundary-based and distance-based methods. ► Our method adopts different attitudes to objects from different parts of the data set. ► We define a hybrid outlier factor to indicate the degree of outlierness of an object. ► The effectiveness of our method is demonstrated on two publicly available data sets. In recent years, much attention has been given to the problem of outlier detection, whose aim is to detect outliers – objects who behave in an unexpected way or have abnormal properties. The identification of outliers is important for many applications such as intrusion detection, credit card fraud, criminal activities in electronic commerce, medical diagnosis and anti-terrorism, etc. In this paper, we propose a hybrid approach to outlier detection, which combines the opinions from boundary-based and distance-based methods for outlier detection ( Jiang et al., 2005, 2009; Knorr and Ng, 1998). We give a novel definition of outliers – BD ( boundary and distance)- based outliers, by virtue of the notion of boundary region in rough set theory and the definitions of distance-based outliers. An algorithm to find such outliers is also given. And the effectiveness of our method for outlier detection is demonstrated on two publicly available databases.
ISSN:0167-8655
1872-7344
DOI:10.1016/j.patrec.2011.07.002