Loading…

Semi-supervised prototype network based on compact-uniform-sparse representation for rotating machinery few-shot class incremental fault diagnosis

Distinct from the ideal ‘closed-set’ assumption of comprehensive fault types and arbitrary data access, the entangled challenges of labeled instance paucity and fault increasing with operation are difficult to circumvent in industrial diagnostics. However, conventional deep learning-based diagnostic...

Full description

Saved in:
Bibliographic Details
Published in:Expert systems with applications 2024-12, Vol.255, p.124660, Article 124660
Main Authors: Zhang, Yu, Han, Dongying, Shi, Peiming
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Distinct from the ideal ‘closed-set’ assumption of comprehensive fault types and arbitrary data access, the entangled challenges of labeled instance paucity and fault increasing with operation are difficult to circumvent in industrial diagnostics. However, conventional deep learning-based diagnostic models lack the capacity for dynamic and incremental acceptance of new faults under limited labeled instances. When confronted with this more industrially realistic diagnostic task, their performance is constrained by overfitting, imbalance, and catastrophic forgetting, thus diminish the feasibility of engineering deployment. In this research, we define this task as a few-shot class incremental fault diagnosis (FS-CIFD) problem and develop a semi-supervised prototype network based on compact-uniform-sparse representation (CUS-SSPN) for this purpose. Structurally, CUS-SSPN incorporates the designed compact-uniform-sparse (CUS) composite loss, optimal transport (OT)-based semi-supervised strategy, and multi-level knowledge distillation (KD) strategy into the prototype network. With the CUS composite loss and OT-based semi-supervised strategy, CUS-SSPN can assign representative prototypes for each incremental fault and form an intra-class compact and inter-class uniform and sparse representation space. This allows the model to incrementally learn new faults while effectively suppressing the overfitting triggered by instance paucity and the biased diagnosis triggered by unbalanced labeling. Combining the designed multi-level KD strategy to convey category centroid information and classification decision knowledge from both prototype and logits levels, catastrophic forgetting of known faults can be effectively mitigated. Extensive FS-CIFD tasks are constructed on both publicly available dataset and practical wind turbine dataset to validate the effectiveness of CUS-SSPN and the feasibility of engineering diagnostics.
ISSN:0957-4174
DOI:10.1016/j.eswa.2024.124660