Loading…

A novel test-cost-sensitive attribute reduction approach using the binary bat algorithm

Attribute reductions are essential pre-processing steps in such as data mining, machine learning, pattern recognition and many other fields. Moreover, test-cost-sensitive attribute reductions are often used when we have to deal with cost-sensitive data. The main result of this paper is a new meta-he...

Full description

Saved in:
Bibliographic Details
Published in:Knowledge-based systems 2019-12, Vol.186, p.104938, Article 104938
Main Authors: Xie, Xiaojun, Qin, Xiaolin, Zhou, Qian, Zhou, Yanghao, Zhang, Tong, Janicki, Ryszard, Zhao, Wei
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Attribute reductions are essential pre-processing steps in such as data mining, machine learning, pattern recognition and many other fields. Moreover, test-cost-sensitive attribute reductions are often used when we have to deal with cost-sensitive data. The main result of this paper is a new meta-heuristic optimization method for finding optimal test-cost-sensitive attribute reduction that is based on binary bat algorithm that originally was designed to model the echolocation behavior of bats when they search their prey. First we provide a 0-1 integer programming algorithm that can calculate optimal reduct but is inefficient for large data sets. We will use it to evaluate other algorithms. Next, a new fitness function that utilizes the pairs of inconsistent objects and does not have any uncertain parameter is design and an efficient algorithm for counting inconsistent pairs is provided. Then, an efficient test-cost-sensitive attribute reduction technique that uses binary bat algorithm is provided. Finally, a new evaluation model with four different evaluation metrics has been proposed and used to evaluate algorithms that only provide sub-optimal solutions. Several experiments were carried out on broadly used benchmark data sets and the results have shown the superiority of our new algorithm, in terms of various metrics, computational time, and classification accuracy, especially for high-dimensional data sets. •We design a new fitness function based on inconsistent object pair.•We propose a binary bat algorithm for MTRP.•We propose four different metrics for evaluating different algorithms.•Our approach has certain advantages compared with the state-of-the-art algorithms.
ISSN:0950-7051
1872-7409
DOI:10.1016/j.knosys.2019.104938