Loading…

A systematic evaluation of filter Unsupervised Feature Selection methods

•A systematic evaluation of filter Unsupervised Feature Selection methods is presented.•The most popular and recent filter UFS methods are included in our study.•The evaluation of the filter UFS methods followed the standards in the literature.•A general discussion based on the results of the evalua...

Full description

Saved in:
Bibliographic Details
Published in:Expert systems with applications 2020-12, Vol.162, p.113745, Article 113745
Main Authors: Solorio-Fernández, Saúl, Ariel Carrasco-Ochoa, J., Martínez-Trinidad, José Fco
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•A systematic evaluation of filter Unsupervised Feature Selection methods is presented.•The most popular and recent filter UFS methods are included in our study.•The evaluation of the filter UFS methods followed the standards in the literature.•A general discussion based on the results of the evaluated methods is provided.•Some guidelines for the use of the evaluated filter UFS methods is also provided. Unsupervised Feature Selection (UFS) has aroused great interest in the last years because of its practical significance and application on a large variety of problems in expert and intelligent systems where unlabeled data appear. Specifically, Unsupervised Feature Selection methods based on the filter approach have received more attention due to their efficiency, scalability, and simplicity. However, in the literature, there are no comprehensive studies for assessing such UFS methods when they are applied, under the same conditions, to a wide variety of real-world data. To fill this gap, in this paper, we present a comprehensive empirical and systematic evaluation of the most popular and recent filter UFS methods, evaluating their performance in terms of clustering, classification, and runtime. The filter methods used in our study were applied on 50 datasets from the UCI Machine Learning Repository and 25 high dimensional datasets from the ASU Feature Selection Repository. To evaluate if the outcomes obtained by the assessed methods are statistically significant, the Friedman test and Holm post hoc procedure were applied in the clustering and classification results. From our experiments, we provide some practical guidelines and insights for the use of the filter UFS methods analyzed in our study.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2020.113745