Loading…
A novel test-cost-sensitive attribute reduction approach using the binary bat algorithm
Attribute reductions are essential pre-processing steps in such as data mining, machine learning, pattern recognition and many other fields. Moreover, test-cost-sensitive attribute reductions are often used when we have to deal with cost-sensitive data. The main result of this paper is a new meta-he...
Saved in:
Published in: | Knowledge-based systems 2019-12, Vol.186, p.104938, Article 104938 |
---|---|
Main Authors: | , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Attribute reductions are essential pre-processing steps in such as data mining, machine learning, pattern recognition and many other fields. Moreover, test-cost-sensitive attribute reductions are often used when we have to deal with cost-sensitive data. The main result of this paper is a new meta-heuristic optimization method for finding optimal test-cost-sensitive attribute reduction that is based on binary bat algorithm that originally was designed to model the echolocation behavior of bats when they search their prey. First we provide a 0-1 integer programming algorithm that can calculate optimal reduct but is inefficient for large data sets. We will use it to evaluate other algorithms. Next, a new fitness function that utilizes the pairs of inconsistent objects and does not have any uncertain parameter is design and an efficient algorithm for counting inconsistent pairs is provided. Then, an efficient test-cost-sensitive attribute reduction technique that uses binary bat algorithm is provided. Finally, a new evaluation model with four different evaluation metrics has been proposed and used to evaluate algorithms that only provide sub-optimal solutions. Several experiments were carried out on broadly used benchmark data sets and the results have shown the superiority of our new algorithm, in terms of various metrics, computational time, and classification accuracy, especially for high-dimensional data sets.
•We design a new fitness function based on inconsistent object pair.•We propose a binary bat algorithm for MTRP.•We propose four different metrics for evaluating different algorithms.•Our approach has certain advantages compared with the state-of-the-art algorithms. |
---|---|
ISSN: | 0950-7051 1872-7409 |
DOI: | 10.1016/j.knosys.2019.104938 |