Loading…

Learning to Preselection: A Filter-Based Performance Predictor for Multiobjective Feature Selection in Classification

Minimizing the classification error rate and the number of selected features are the two major objectives of feature selection, and they are often in conflict with each other, which is a multiobjective problem. Evolutionary algorithms have been widely used for multiobjective feature selection proble...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on evolutionary computation 2024, p.1-1
Main Authors: Jiao, Ruwang, Xue, Bing, Zhang, Mengjie
Format: Article
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 1
container_issue
container_start_page 1
container_title IEEE transactions on evolutionary computation
container_volume
creator Jiao, Ruwang
Xue, Bing
Zhang, Mengjie
description Minimizing the classification error rate and the number of selected features are the two major objectives of feature selection, and they are often in conflict with each other, which is a multiobjective problem. Evolutionary algorithms have been widely used for multiobjective feature selection problems. Preselection in evolutionary algorithms is used to improve the sampling quality by selecting only potentially promising candidate solutions for fitness evaluations. However, traditional preselection methods struggle to effectively handle feature selection due to its large-scale combinatorial nature and intricate feature interactions. To alleviate this issue, this paper proposes a filter-based performance predictor to preselect feature subsets for subsequent classification fitness evaluations. It uses multiple filter measures to estimate the classification performance of a feature subset, which can explore complex feature interactions and is also insensitive to the dimensionality. Additionally, a correlation coefficient is used to measure the compatibility between the learned performance predictor and the classification performance. Based on the degree of compatibility, a preselection method that considers both the predicted classification performance and the feature subset diversity is proposed, which can preselect promising solutions from multiple candidate solutions and thus improve the feature subset search efficiency. The proposed method is verified experimentally on a total of 18 classification datasets spanning various domains, and the results reveal that it can find feature subsets with better classification performance and converge faster to competitive results compared to state-of-the-art methods.
doi_str_mv 10.1109/TEVC.2024.3373802
format article
fullrecord <record><control><sourceid>crossref_CHZPO</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TEVC_2024_3373802</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10461078</ieee_id><sourcerecordid>10_1109_TEVC_2024_3373802</sourcerecordid><originalsourceid>FETCH-LOGICAL-c133t-20a74f984bd80a1f1537e663d5767907e6b6f9caba1191cc6dc4bdad999f0103</originalsourceid><addsrcrecordid>eNpNkNFKwzAUhoMoOKcPIHiRF-g8p-mSxrtZNhUmDhziXUnTE8noWkk6wbe3xQlenZ_D__0XH2PXCDNE0Lfb5VsxSyHNZkIokUN6wiaoM0wAUnk6ZMh1olT-fs4uYtwBYDZHPWGHNZnQ-vaD9x3fBIrUkO19197xBV_5pqeQ3JtINd9QcF3Ym9bSWKy97bvAhxd_PjQDUe1G8Iv4ikx_CMRf_6a4b3nRmBi989aMn0t25kwT6ep4p2y7Wm6Lx2T98vBULNaJRSH6JAWjMqfzrKpzMOhwLhRJKeq5kkrDkCvptDWVQdRoraztUDW11toBgpgy_J21oYsxkCs_g9-b8F0ilKO2ctRWjtrKo7aBufllPBH962cSQeXiB1x-a6w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Learning to Preselection: A Filter-Based Performance Predictor for Multiobjective Feature Selection in Classification</title><source>IEEE Xplore All Conference Series</source><creator>Jiao, Ruwang ; Xue, Bing ; Zhang, Mengjie</creator><creatorcontrib>Jiao, Ruwang ; Xue, Bing ; Zhang, Mengjie</creatorcontrib><description>Minimizing the classification error rate and the number of selected features are the two major objectives of feature selection, and they are often in conflict with each other, which is a multiobjective problem. Evolutionary algorithms have been widely used for multiobjective feature selection problems. Preselection in evolutionary algorithms is used to improve the sampling quality by selecting only potentially promising candidate solutions for fitness evaluations. However, traditional preselection methods struggle to effectively handle feature selection due to its large-scale combinatorial nature and intricate feature interactions. To alleviate this issue, this paper proposes a filter-based performance predictor to preselect feature subsets for subsequent classification fitness evaluations. It uses multiple filter measures to estimate the classification performance of a feature subset, which can explore complex feature interactions and is also insensitive to the dimensionality. Additionally, a correlation coefficient is used to measure the compatibility between the learned performance predictor and the classification performance. Based on the degree of compatibility, a preselection method that considers both the predicted classification performance and the feature subset diversity is proposed, which can preselect promising solutions from multiple candidate solutions and thus improve the feature subset search efficiency. The proposed method is verified experimentally on a total of 18 classification datasets spanning various domains, and the results reveal that it can find feature subsets with better classification performance and converge faster to competitive results compared to state-of-the-art methods.</description><identifier>ISSN: 1089-778X</identifier><identifier>EISSN: 1941-0026</identifier><identifier>DOI: 10.1109/TEVC.2024.3373802</identifier><identifier>CODEN: ITEVF5</identifier><language>eng</language><publisher>IEEE</publisher><subject>classification ; Evolutionary feature selection ; multiobjective learning ; preselection</subject><ispartof>IEEE transactions on evolutionary computation, 2024, p.1-1</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0003-4463-9538 ; 0000-0003-0780-1110 ; 0000-0002-4865-8026</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10461078$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,4024,27923,27924,27925,54555,54796,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10461078$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Jiao, Ruwang</creatorcontrib><creatorcontrib>Xue, Bing</creatorcontrib><creatorcontrib>Zhang, Mengjie</creatorcontrib><title>Learning to Preselection: A Filter-Based Performance Predictor for Multiobjective Feature Selection in Classification</title><title>IEEE transactions on evolutionary computation</title><addtitle>TEVC</addtitle><description>Minimizing the classification error rate and the number of selected features are the two major objectives of feature selection, and they are often in conflict with each other, which is a multiobjective problem. Evolutionary algorithms have been widely used for multiobjective feature selection problems. Preselection in evolutionary algorithms is used to improve the sampling quality by selecting only potentially promising candidate solutions for fitness evaluations. However, traditional preselection methods struggle to effectively handle feature selection due to its large-scale combinatorial nature and intricate feature interactions. To alleviate this issue, this paper proposes a filter-based performance predictor to preselect feature subsets for subsequent classification fitness evaluations. It uses multiple filter measures to estimate the classification performance of a feature subset, which can explore complex feature interactions and is also insensitive to the dimensionality. Additionally, a correlation coefficient is used to measure the compatibility between the learned performance predictor and the classification performance. Based on the degree of compatibility, a preselection method that considers both the predicted classification performance and the feature subset diversity is proposed, which can preselect promising solutions from multiple candidate solutions and thus improve the feature subset search efficiency. The proposed method is verified experimentally on a total of 18 classification datasets spanning various domains, and the results reveal that it can find feature subsets with better classification performance and converge faster to competitive results compared to state-of-the-art methods.</description><subject>classification</subject><subject>Evolutionary feature selection</subject><subject>multiobjective learning</subject><subject>preselection</subject><issn>1089-778X</issn><issn>1941-0026</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpNkNFKwzAUhoMoOKcPIHiRF-g8p-mSxrtZNhUmDhziXUnTE8noWkk6wbe3xQlenZ_D__0XH2PXCDNE0Lfb5VsxSyHNZkIokUN6wiaoM0wAUnk6ZMh1olT-fs4uYtwBYDZHPWGHNZnQ-vaD9x3fBIrUkO19197xBV_5pqeQ3JtINd9QcF3Ym9bSWKy97bvAhxd_PjQDUe1G8Iv4ikx_CMRf_6a4b3nRmBi989aMn0t25kwT6ep4p2y7Wm6Lx2T98vBULNaJRSH6JAWjMqfzrKpzMOhwLhRJKeq5kkrDkCvptDWVQdRoraztUDW11toBgpgy_J21oYsxkCs_g9-b8F0ilKO2ctRWjtrKo7aBufllPBH962cSQeXiB1x-a6w</recordid><startdate>2024</startdate><enddate>2024</enddate><creator>Jiao, Ruwang</creator><creator>Xue, Bing</creator><creator>Zhang, Mengjie</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0003-4463-9538</orcidid><orcidid>https://orcid.org/0000-0003-0780-1110</orcidid><orcidid>https://orcid.org/0000-0002-4865-8026</orcidid></search><sort><creationdate>2024</creationdate><title>Learning to Preselection: A Filter-Based Performance Predictor for Multiobjective Feature Selection in Classification</title><author>Jiao, Ruwang ; Xue, Bing ; Zhang, Mengjie</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c133t-20a74f984bd80a1f1537e663d5767907e6b6f9caba1191cc6dc4bdad999f0103</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>classification</topic><topic>Evolutionary feature selection</topic><topic>multiobjective learning</topic><topic>preselection</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jiao, Ruwang</creatorcontrib><creatorcontrib>Xue, Bing</creatorcontrib><creatorcontrib>Zhang, Mengjie</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Xplore</collection><collection>CrossRef</collection><jtitle>IEEE transactions on evolutionary computation</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Jiao, Ruwang</au><au>Xue, Bing</au><au>Zhang, Mengjie</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Learning to Preselection: A Filter-Based Performance Predictor for Multiobjective Feature Selection in Classification</atitle><jtitle>IEEE transactions on evolutionary computation</jtitle><stitle>TEVC</stitle><date>2024</date><risdate>2024</risdate><spage>1</spage><epage>1</epage><pages>1-1</pages><issn>1089-778X</issn><eissn>1941-0026</eissn><coden>ITEVF5</coden><abstract>Minimizing the classification error rate and the number of selected features are the two major objectives of feature selection, and they are often in conflict with each other, which is a multiobjective problem. Evolutionary algorithms have been widely used for multiobjective feature selection problems. Preselection in evolutionary algorithms is used to improve the sampling quality by selecting only potentially promising candidate solutions for fitness evaluations. However, traditional preselection methods struggle to effectively handle feature selection due to its large-scale combinatorial nature and intricate feature interactions. To alleviate this issue, this paper proposes a filter-based performance predictor to preselect feature subsets for subsequent classification fitness evaluations. It uses multiple filter measures to estimate the classification performance of a feature subset, which can explore complex feature interactions and is also insensitive to the dimensionality. Additionally, a correlation coefficient is used to measure the compatibility between the learned performance predictor and the classification performance. Based on the degree of compatibility, a preselection method that considers both the predicted classification performance and the feature subset diversity is proposed, which can preselect promising solutions from multiple candidate solutions and thus improve the feature subset search efficiency. The proposed method is verified experimentally on a total of 18 classification datasets spanning various domains, and the results reveal that it can find feature subsets with better classification performance and converge faster to competitive results compared to state-of-the-art methods.</abstract><pub>IEEE</pub><doi>10.1109/TEVC.2024.3373802</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0003-4463-9538</orcidid><orcidid>https://orcid.org/0000-0003-0780-1110</orcidid><orcidid>https://orcid.org/0000-0002-4865-8026</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1089-778X
ispartof IEEE transactions on evolutionary computation, 2024, p.1-1
issn 1089-778X
1941-0026
language eng
recordid cdi_crossref_primary_10_1109_TEVC_2024_3373802
source IEEE Xplore All Conference Series
subjects classification
Evolutionary feature selection
multiobjective learning
preselection
title Learning to Preselection: A Filter-Based Performance Predictor for Multiobjective Feature Selection in Classification
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T03%3A58%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Learning%20to%20Preselection:%20A%20Filter-Based%20Performance%20Predictor%20for%20Multiobjective%20Feature%20Selection%20in%20Classification&rft.jtitle=IEEE%20transactions%20on%20evolutionary%20computation&rft.au=Jiao,%20Ruwang&rft.date=2024&rft.spage=1&rft.epage=1&rft.pages=1-1&rft.issn=1089-778X&rft.eissn=1941-0026&rft.coden=ITEVF5&rft_id=info:doi/10.1109/TEVC.2024.3373802&rft_dat=%3Ccrossref_CHZPO%3E10_1109_TEVC_2024_3373802%3C/crossref_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c133t-20a74f984bd80a1f1537e663d5767907e6b6f9caba1191cc6dc4bdad999f0103%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10461078&rfr_iscdi=true