Loading…

An automated deep learning pipeline based on advanced optimisations for leveraging spectral classification modelling

In deep learning (DL) modelling for spectral data, a major challenge is related to the choice of DL network architecture and the selection of the best hyperparameters. Often, slight changes to the neural architecture or its hyperparameter can have a direct influence on the model's performance,...

Full description

Saved in:
Bibliographic Details
Published in:Chemometrics and intelligent laboratory systems 2021-08, Vol.215, p.104354, Article 104354
Main Authors: Passos, Dário, Mishra, Puneet
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In deep learning (DL) modelling for spectral data, a major challenge is related to the choice of DL network architecture and the selection of the best hyperparameters. Often, slight changes to the neural architecture or its hyperparameter can have a direct influence on the model's performance, making its robustness questionable. To deal with it, this study presents an automated deep learning modelling based on advanced optimisation techniques involving Hyperband and Bayesian optimisation, to automatically find optimal neural architecture and its hyperparameters to reach robust DL models. The optimisation requires a base neural architecture to be initialized, however, later it automatically adjusts the neural architecture and the hyperparameters to reach the optimal model. Furthermore, to support the interpretation of the DL models, a wavelength weighing schema based on gradient-weighted class activation mapping (Grad-CAM) was implemented. The potential of the approach was showed on a real case of wheat variety classification with near-infrared spectral data. The performance of the classification was compared with that previously reported on the same dataset with different DL and chemometric approaches. The results showed that with the proposed approach a classification accuracy of 94.9% was reached, which was better than the best reported accuracy on the same data set i.e., 93%. Furthermore, the better performance was obtained with a simpler neural architecture compared to what was used in earlier studies. The automated deep learning based on advanced optimisation can support DL modelling of spectral data. •An automated deep learning pipeline is proposed.•With advanced optimisation optimal DL model were identified.•Ensemble of spectral pre-processing boosted DL model performance.•The proposed optimisation is 3 times faster than random optimisation.
ISSN:0169-7439
1873-3239
DOI:10.1016/j.chemolab.2021.104354