Loading…

Development and application of an optimised Bayesian shrinkage prior for spectroscopic biomedical diagnostics

•Alternative analysis methods, such as neural networks, require large datasets.•Biological spectra data can be significantly noisy, thus difficult to classify.•Introduction of a new shrinkage prior, LN–CASS, for Bayesian analysis.•Enables accurate and flexible modelling of vibrational spectra data.•...

Full description

Saved in:
Bibliographic Details
Published in:Computer methods and programs in biomedicine 2024-03, Vol.245, p.108014-108014, Article 108014
Main Authors: Chu, Hin On, Buchan, Emma, Smith, David, Goldberg Oppenheimer, Pola
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•Alternative analysis methods, such as neural networks, require large datasets.•Biological spectra data can be significantly noisy, thus difficult to classify.•Introduction of a new shrinkage prior, LN–CASS, for Bayesian analysis.•Enables accurate and flexible modelling of vibrational spectra data.•Classification for small datasets, as seen with the saliva dataset with 100 % accuracy. Classification of vibrational spectra is often challenging for biological substances containing similar molecular bonds, interfering with spectral outputs. To address this, various approaches are widely studied. However, whilst providing powerful estimations, these techniques are computationally extensive and frequently overfit the data. Shrinkage priors, which favour models with relatively few predictor variables, are often applied in Bayesian penalisation techniques to avoid overfitting. Using the logit-normal continuous analogue of the spike-and-slab (LN–CASS) as the shrinkage prior and modelling, we have established classification for accurate analysis, with the established system found to be faster than conventional least absolute shrinkage and selection operator, horseshoe or spike-and-slab. These were examined versus coefficient data based on a linear regression model and vibrational spectra produced via density functional theory calculations. Then applied to Raman spectra from saliva to classify the sample sex. Subsequently applied to the acquired spectra from saliva, the evaluated models exhibited high accuracy (AUC>90 %) even when number of parameters was higher than the number of observations. Analyses of spectra for all Bayesian models yielded high-classification accuracy upon cross-validation. Further, for saliva sensing, LN–CASS was found to be the only classifier with 100 %-accuracy in predicting the output based on a leave-one-out cross validation. With potential applications in aiding diagnosis from small spectroscopic datasets and are compatible with a range of spectroscopic data formats. As seen with the classification of IR and Raman spectra. These results are highly promising for emerging developments of spectroscopic platforms for biomedical diagnostic sensing systems.
ISSN:0169-2607
1872-7565
DOI:10.1016/j.cmpb.2024.108014