Loading…

Ensemble dimension reduction based on spectral disturbance for subspace clustering

The feature distribution of high dimension, small sample size (HDSS) data is sparse, resulting in unsatisfactory clustering results. Dimension reduction methods play an inevitable role in analyzing and visualizing high-dimensional data. It is likely to cause the matrix singularity for subspace clust...

Full description

Saved in:
Bibliographic Details
Published in:Knowledge-based systems 2021-09, Vol.227, p.107182, Article 107182
Main Authors: Chen, Xiaoyun, Wang, Qiaoping, Zhuang, Shanshan
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The feature distribution of high dimension, small sample size (HDSS) data is sparse, resulting in unsatisfactory clustering results. Dimension reduction methods play an inevitable role in analyzing and visualizing high-dimensional data. It is likely to cause the matrix singularity for subspace clustering when directly reduce the dimension of HDSS dataset. Therefore, we construct multiple data subsets from the original HDSS dataset for ensemble dimension reduction. Projection least square regression subspace clustering (PLSR) which combines projection technique with least-square regression is used as a base dimension reducer for ensemble dimension reduction, called EPLSR. Considering the spectral properties of spectral clustering, we propose the ensemble dimension reduction for subspace clustering based on spectral disturbance (SD-EPLSR) method. According to the theory of spectral disturbance, the weight coefficients are learned according to two principles: 1. The clustering results on each data subset should be close to the consensus clustering result. 2. Data subsets with similar clustering results should have approximate weights. Experiments on eight HDSS datasets demonstrate that our method is effective.
ISSN:0950-7051
1872-7409
DOI:10.1016/j.knosys.2021.107182