Loading…

Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction

In this paper, we propose an innovative approach for robust prediction of processing tomato yield using open-source AutoML techniques and statistical analysis. Sentinel-2 satellite imagery was deployed to obtain values of five (5) selected vegetation indices (VIs) during the growing season of 2021 (...

Full description

Saved in:
Bibliographic Details
Published in:Sensors (Basel, Switzerland) Switzerland), 2023-02, Vol.23 (5), p.2586
Main Authors: Darra, Nicoleta, Espejo-Garcia, Borja, Kasimati, Aikaterini, Kriezi, Olga, Psomiadis, Emmanouil, Fountas, Spyros
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this paper, we propose an innovative approach for robust prediction of processing tomato yield using open-source AutoML techniques and statistical analysis. Sentinel-2 satellite imagery was deployed to obtain values of five (5) selected vegetation indices (VIs) during the growing season of 2021 (April to September) at 5-day intervals. Actual recorded yields were collected across 108 fields, corresponding to a total area of 410.10 ha of processing tomato in central Greece, to assess the performance of Vis at different temporal scales. In addition, VIs were connected with the crop phenology to establish the annual dynamics of the crop. The highest Pearson coefficient (r) values occurred during a period of 80 to 90 days, indicating the strong relationship between the VIs and the yield. Specifically, RVI presented the highest correlation values of the growing season at 80 (r = 0.72) and 90 days (r = 0.75), while NDVI performed better at 85 days (r = 0.72). This output was confirmed by the AutoML technique, which also indicated the highest performance of the VIs during the same period, with the values of the adjusted R ranging from 0.60 to 0.72. The most precise results were obtained with the combination of ARD regression and SVR, which was the most successful combination for building an ensemble (adj. R = 0.67 ± 0.02).
ISSN:1424-8220
1424-8220
DOI:10.3390/s23052586