Loading…

A novel multi-label feature selection method based on knowledge consistency-independence index

Multi-label classification encounters the challenge of dealing with high dimensional data. In response to this challenge, numerous researchers have proposed various multi-label feature selection methods from different perspectives. However, existing methods overlook the consistency and independence...

Full description

Saved in:
Bibliographic Details
Published in:Information sciences 2024-08, Vol.677, Article 120870
Main Authors: Liu, Xiangbin, Zheng, Heming, Chen, Wenxiang, Xia, Liyun, Dai, Jianhua
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Multi-label classification encounters the challenge of dealing with high dimensional data. In response to this challenge, numerous researchers have proposed various multi-label feature selection methods from different perspectives. However, existing methods overlook the consistency and independence of knowledge granules, and thus fail to extract valuable and distinctive information from the knowledge granules that is relevant to the label space. To address this issue, we propose a novel multi-label feature selection method based on the knowledge consistency-independence index (CIMLFS). Firstly, we introduce the concepts of knowledge consistency granularity and knowledge independence granularity to explore valuable and distinctive information from the knowledge granule families. Secondly, based upon these concepts, we define the consistency coefficient, independence coefficient, and consistency gain for features, ultimately considering the three perspectives to achieve the knowledge consistency-independence index. Furthermore, we present a multi-label feature selection method utilizing the index. Finally, to assess the effectiveness of CIMLFS, we conduct comparative experiments with eight representative multi-label feature selection methods on twelve benchmark multi-label data sets and using four evaluation metrics. The final experimental results indicate that CIMLFS ranks the first on three metrics and the second on one metric.
ISSN:0020-0255
1872-6291
DOI:10.1016/j.ins.2024.120870