Loading…

Evaluation of pre-processing and variable selection on energy dispersive X-ray fluorescence spectral data with partial least square regression: A case of study for soil organic carbon prediction

Most studies which have reported soil fertility attributes employing Energy Dispersive X-ray Fluorescence (EDXRF) combined with multivariate calibration make use of elemental concentration data. This combination may cause relevant information loss contained in EDXRF spectra. However, a well-establis...

Full description

Saved in:
Bibliographic Details
Published in:Spectrochimica acta. Part B: Atomic spectroscopy 2021-01, Vol.175, p.106016, Article 106016
Main Authors: dos Santos, Felipe Rodrigues, de Oliveira, José Francirlei, Bona, Evandro, Barbosa, Graziela M.C., Melquiades, Fábio Luiz
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Most studies which have reported soil fertility attributes employing Energy Dispersive X-ray Fluorescence (EDXRF) combined with multivariate calibration make use of elemental concentration data. This combination may cause relevant information loss contained in EDXRF spectra. However, a well-established soil EDXRF spectra data treatment procedure for multivariate calibration is not currently available. The objective of this study was to evaluate the influence of different pre-processing and variable selection methods in partial least square regression models using EDXRF spectral data. Measurements were obtained under two experimental conditions (15 kV and 50 kV at tube) for soil organic carbon determination. Poisson scaling + mean center proved to be the most suitable pre-processing for this data set. The variable selection by successive projection algorithm for interval selection in partial least squares improved the performance of all tested pre-processing (or at least kept constant in terms of the errors). The 15 kV condition models with Pareto scaling and Poisson scaling + mean center were the most accurate and precise. The ratio to performance of deviation values for these models was of 2.2. The figures of merit demonstrated the soil organic carbon determination feasibility using EDXRF spectral data with these pre-processing since the accuracy, precision and limits of detection were consistent with previous reports. Thus, this study contributes toward the establishment of an approach for soil EDXRF spectral data treatment for multivariate calibration. It also contributes to a better EDXRF variables interpretation which impacts soil organic carbon modeling, demonstrating the proposed methodology potential. [Display omitted] •Poisson scaling + mean center was suitable for pre-processing EDXRF spectral data.•Variable selection highlighted the spectra interval which influenced the models.•SOC was determined with accuracy and precision using EDXRF spectral data.
ISSN:0584-8547
1873-3565
DOI:10.1016/j.sab.2020.106016