Loading…

A relative entropy based feature selection framework for asset data in predictive maintenance

•High-dimensional asset data limit the performance of machine learning algorithms.•Features that measure an asset’s condition are characterized per ability to capture fault implications.•General feature engineering methods should not consistently adequate for raw asset data.•Feature selection method...

Full description

Saved in:
Bibliographic Details
Published in:Computers & industrial engineering 2020-07, Vol.145, p.106536, Article 106536
Main Authors: Aremu, Oluseun Omotola, Cody, Roya Allison, Hyland-Wood, David, McAree, Peter Ross
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•High-dimensional asset data limit the performance of machine learning algorithms.•Features that measure an asset’s condition are characterized per ability to capture fault implications.•General feature engineering methods should not consistently adequate for raw asset data.•Feature selection method applied to C-MAPSS dataset. Predictive maintenance (PdM) is applied to monitor a system’s life cycle to provide current diagnostics, prognostics and provide information capable of guiding maintenance related decisions. Often, an asset’s life cycle is monitored using multiple measurements which translate to high-dimensional (multivariate) data. The large volume of data used to describe an asset’s life cycle has led to current state-of-the-art data-driven PdM relying on machine learning (ML). As research shows, high-dimensional data diminish ML algorithm performance. Generally, high-dimensionality is managed by feature engineering, except asset data characteristics differ from characteristics managed in typical feature engineering problems. In data-driven PdM, information regarding observed faults in an asset is important. Such information is often misinterpreted or lost when general feature engineering is performed on asset data. This work proposes a correlation and relative entropy (C-RE) feature engineering framework specific to asset data. C-RE, applies correlation based hierarchical clustering and relative entropy through the measure of Kullback–Leibler divergence to generate a lower-dimensional feature subset of the original data. The resulting feature subset has minimal redundancies and the highest content of domain-specific information relating to the influence of faults observed during an asset’s life cycle. The utility of C-RE is demonstrated on the Commercial Modular Aero-Propulsion System Simulation (C-MAPSS) dataset which describes the run-to-failure life cycles of multiple aircraft engines.
ISSN:0360-8352
1879-0550
DOI:10.1016/j.cie.2020.106536