Loading…
A systematic evaluation of filter Unsupervised Feature Selection methods
•A systematic evaluation of filter Unsupervised Feature Selection methods is presented.•The most popular and recent filter UFS methods are included in our study.•The evaluation of the filter UFS methods followed the standards in the literature.•A general discussion based on the results of the evalua...
Saved in:
Published in: | Expert systems with applications 2020-12, Vol.162, p.113745, Article 113745 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •A systematic evaluation of filter Unsupervised Feature Selection methods is presented.•The most popular and recent filter UFS methods are included in our study.•The evaluation of the filter UFS methods followed the standards in the literature.•A general discussion based on the results of the evaluated methods is provided.•Some guidelines for the use of the evaluated filter UFS methods is also provided.
Unsupervised Feature Selection (UFS) has aroused great interest in the last years because of its practical significance and application on a large variety of problems in expert and intelligent systems where unlabeled data appear. Specifically, Unsupervised Feature Selection methods based on the filter approach have received more attention due to their efficiency, scalability, and simplicity. However, in the literature, there are no comprehensive studies for assessing such UFS methods when they are applied, under the same conditions, to a wide variety of real-world data. To fill this gap, in this paper, we present a comprehensive empirical and systematic evaluation of the most popular and recent filter UFS methods, evaluating their performance in terms of clustering, classification, and runtime. The filter methods used in our study were applied on 50 datasets from the UCI Machine Learning Repository and 25 high dimensional datasets from the ASU Feature Selection Repository. To evaluate if the outcomes obtained by the assessed methods are statistically significant, the Friedman test and Holm post hoc procedure were applied in the clustering and classification results. From our experiments, we provide some practical guidelines and insights for the use of the filter UFS methods analyzed in our study. |
---|---|
ISSN: | 0957-4174 1873-6793 |
DOI: | 10.1016/j.eswa.2020.113745 |