Loading…

Categorical Feature GAN for Imbalanced Intelligent Fault Diagnosis of Rotating Machinery

The problem of imbalanced data amounts between the samples of healthy and faulty conditions degrades the performance of traditional intelligent diagnosis methods for rotating machinery. This article explores a data generation approach to improve the diagnostic performance of key components in the ro...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on instrumentation and measurement 2023, Vol.72, p.1-12
Main Authors: Dai, Jun, Wang, Jun, Yao, Linquan, Huang, Weiguo, Zhu, Zhongkui
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The problem of imbalanced data amounts between the samples of healthy and faulty conditions degrades the performance of traditional intelligent diagnosis methods for rotating machinery. This article explores a data generation approach to improve the diagnostic performance of key components in the rotating machinery. A new generative adversarial network (GAN) model, called categorical feature GAN (CFGAN), is proposed to generate sufficient synthetic samples for the minority faulty conditions so as to rebalance the datasets. The main contribution is that the quality of the synthetic samples is ensured by considering both consistency and diversity with the real samples, which conduces to machinery fault diagnosis. Specifically, latent categorical features are first learned from the real fault samples via the proposed CFGAN, which is an integration model of autoencoder (AE) and auxiliary classifier GAN (ACGAN). Then, synthetic categorical features are produced by interpolation among the learned categorical features followed by slight noise addition. Finally, synthetic samples are generated using the synthetic categorical features through the generator of the CFGAN. In addition, Wasserstein distance (WD) is applied to the CFGAN to avoid mode collapse in training. Because categorical features rather than random noise are used to generate samples and the interpolation is performed in the feature space instead of the noisy data space, the consistency of the synthetic samples is improved as compared to the traditional methods. The diversity is ensured by the use of diverse synthetic categorical features and the WD. Two machinery datasets under varying imbalance ratios confirmed that the proposed CFGAN model can synthesize high-quality samples and has better generalization capability over the existing data generation methods in machinery fault diagnosis.
ISSN:0018-9456
1557-9662
DOI:10.1109/TIM.2023.3298425