Loading…

An ensemble machine learning approach for determination of the optimum sampling time for evapotranspiration assessment from high-throughput phenotyping data

•High-frequency evapotranspiration (ET) phenotyping is key to drought-stress analyses.•Optimum ET sampling frequency identified by downscaling 15-min interval ET profiles.•Time series forecasting of ET was performed using ensemble machine-learning models.•ET forecasting and genotype classification p...

Full description

Saved in:
Bibliographic Details
Published in:Computers and electronics in agriculture 2021-03, Vol.182, p.105992, Article 105992
Main Authors: Kar, Soumyashree, Purbey, Vikram Kumar, Suradhaniwar, Saurabh, Korbu, Lijalem Balcha, Kholová, Jana, Durbha, Surya S., Adinarayana, J., Vadez, Vincent
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3
cites cdi_FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3
container_end_page
container_issue
container_start_page 105992
container_title Computers and electronics in agriculture
container_volume 182
creator Kar, Soumyashree
Purbey, Vikram Kumar
Suradhaniwar, Saurabh
Korbu, Lijalem Balcha
Kholová, Jana
Durbha, Surya S.
Adinarayana, J.
Vadez, Vincent
description •High-frequency evapotranspiration (ET) phenotyping is key to drought-stress analyses.•Optimum ET sampling frequency identified by downscaling 15-min interval ET profiles.•Time series forecasting of ET was performed using ensemble machine-learning models.•ET forecasting and genotype classification performance was compared at each scale.•60-min interval of ET was found optimum with minimum redundancy and information loss. Efficient selection of drought-tolerant crops requires identification and high-throughput phenotyping (HTP) of the complex functional (especially canopy-conductance) traits that elicit plant responses to continually fluctuating environmental conditions. However, phenotyping of such dynamic physiology-based traits has been immensely challenging especially due to the limited availability of adequate methods that can provide continuous measurements of plant-water relations. Therefore, gravimetric phenotyping of plants is being increasingly used to allow one-to-one monitoring of plant-water relations and generate continuous evapotranspiration (ET) profiles. The gravimetric sensors or load cells can provide ET estimates at very high frequencies, e.g. 15-min interval, as chosen by the user. There is however, no study on understanding the optimum frequency or the sampling time at which ET needs to be monitored, such that data-redundancy, noise and processing overhead could be reduced. Hence, this paper makes a novel attempt in identifying the optimum sampling time for phenotyping ET from load cells time series. The proposed procedure includes an ensemble Machine-Learning (ML) approach for optimizing the sampling time through time series forecasting of ET profiles and classification of genotypes using the forecasted ET values. High-frequency load cells data from the LeasyScan, HTP platform, ICRISAT were used to derive the ET profiles at frequencies or scales varying from 15-min to 180-min, followed by ET forecasting and classification at each frequency. For both forecasting and classification, an ensemble of three ML algorithms i.e. Support Vector Machines (SVM), Artificial Neural Network (ANN) and Random Forests (RF) were leveraged. Consequently, the performance metrics (of both the operations) obtained from the ensemble were used to compute the entropy-based optimum sampling time. The results reveal that 60-min interval HTP data could be credibly used for both, forecasting ET as well as correctly classifying the genotypes.
doi_str_mv 10.1016/j.compag.2021.105992
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2516238709</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0168169921000107</els_id><sourcerecordid>2516238709</sourcerecordid><originalsourceid>FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3</originalsourceid><addsrcrecordid>eNp9UU2LFDEUDKLguPoPPAQ895j0VzoXYVl0XVjYi57DS-f1dIbJh0l6Yf-LP9aM7dnTo4qqehRFyEfOjpzx8fP5OAcX4XRsWcsrNUjZviIHPom2EZyJ1-RQZVPDRynfknc5n1nFchIH8vvWU_QZnb4gdTCv1iO9ICRv_YlCjClUki4hUYMFk7Meig2ehoWWFWmIxbrN0QwuXq6WCvGvHJ8hhpLA52jT7oGcMWeHvtAlBUdXe1qbsqawnda4FRpX9KG8xGuOgQLvyZsFLhk__Ls35Oe3rz_uvjePT_cPd7ePzdxNrDTctNx04zKw0Rg5zqCNFsIMCDhqKYBLrqXuWd9pY8zQih4YDkzqsec906a7IZ_23Nr214a5qHPYkq8vVTvwse0mwWRV9btqTiHnhIuKyTpIL4ozdd1BndW-g7ruoPYdqu3LbsPa4NliUnm26Gc0NuFclAn2_wF_AEhdmCg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2516238709</pqid></control><display><type>article</type><title>An ensemble machine learning approach for determination of the optimum sampling time for evapotranspiration assessment from high-throughput phenotyping data</title><source>Elsevier</source><creator>Kar, Soumyashree ; Purbey, Vikram Kumar ; Suradhaniwar, Saurabh ; Korbu, Lijalem Balcha ; Kholová, Jana ; Durbha, Surya S. ; Adinarayana, J. ; Vadez, Vincent</creator><creatorcontrib>Kar, Soumyashree ; Purbey, Vikram Kumar ; Suradhaniwar, Saurabh ; Korbu, Lijalem Balcha ; Kholová, Jana ; Durbha, Surya S. ; Adinarayana, J. ; Vadez, Vincent</creatorcontrib><description>•High-frequency evapotranspiration (ET) phenotyping is key to drought-stress analyses.•Optimum ET sampling frequency identified by downscaling 15-min interval ET profiles.•Time series forecasting of ET was performed using ensemble machine-learning models.•ET forecasting and genotype classification performance was compared at each scale.•60-min interval of ET was found optimum with minimum redundancy and information loss. Efficient selection of drought-tolerant crops requires identification and high-throughput phenotyping (HTP) of the complex functional (especially canopy-conductance) traits that elicit plant responses to continually fluctuating environmental conditions. However, phenotyping of such dynamic physiology-based traits has been immensely challenging especially due to the limited availability of adequate methods that can provide continuous measurements of plant-water relations. Therefore, gravimetric phenotyping of plants is being increasingly used to allow one-to-one monitoring of plant-water relations and generate continuous evapotranspiration (ET) profiles. The gravimetric sensors or load cells can provide ET estimates at very high frequencies, e.g. 15-min interval, as chosen by the user. There is however, no study on understanding the optimum frequency or the sampling time at which ET needs to be monitored, such that data-redundancy, noise and processing overhead could be reduced. Hence, this paper makes a novel attempt in identifying the optimum sampling time for phenotyping ET from load cells time series. The proposed procedure includes an ensemble Machine-Learning (ML) approach for optimizing the sampling time through time series forecasting of ET profiles and classification of genotypes using the forecasted ET values. High-frequency load cells data from the LeasyScan, HTP platform, ICRISAT were used to derive the ET profiles at frequencies or scales varying from 15-min to 180-min, followed by ET forecasting and classification at each frequency. For both forecasting and classification, an ensemble of three ML algorithms i.e. Support Vector Machines (SVM), Artificial Neural Network (ANN) and Random Forests (RF) were leveraged. Consequently, the performance metrics (of both the operations) obtained from the ensemble were used to compute the entropy-based optimum sampling time. The results reveal that 60-min interval HTP data could be credibly used for both, forecasting ET as well as correctly classifying the genotypes.</description><identifier>ISSN: 0168-1699</identifier><identifier>EISSN: 1872-7107</identifier><identifier>DOI: 10.1016/j.compag.2021.105992</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Algorithms ; Artificial neural networks ; Classification ; Ensemble machine learning ; Evapotranspiration ; Forecasting ; Gravimetry ; High throughput phenotyping ; Learning theory ; Load cells ; Machine learning ; Noise monitoring ; Optimization ; Performance measurement ; Redundancy ; Resistance ; Sampling ; Sampling time optimization ; Support vector machines ; Time series ; Time series classification ; Time series forecasting ; Very high frequencies</subject><ispartof>Computers and electronics in agriculture, 2021-03, Vol.182, p.105992, Article 105992</ispartof><rights>2021 Elsevier B.V.</rights><rights>Copyright Elsevier BV Mar 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3</citedby><cites>FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Kar, Soumyashree</creatorcontrib><creatorcontrib>Purbey, Vikram Kumar</creatorcontrib><creatorcontrib>Suradhaniwar, Saurabh</creatorcontrib><creatorcontrib>Korbu, Lijalem Balcha</creatorcontrib><creatorcontrib>Kholová, Jana</creatorcontrib><creatorcontrib>Durbha, Surya S.</creatorcontrib><creatorcontrib>Adinarayana, J.</creatorcontrib><creatorcontrib>Vadez, Vincent</creatorcontrib><title>An ensemble machine learning approach for determination of the optimum sampling time for evapotranspiration assessment from high-throughput phenotyping data</title><title>Computers and electronics in agriculture</title><description>•High-frequency evapotranspiration (ET) phenotyping is key to drought-stress analyses.•Optimum ET sampling frequency identified by downscaling 15-min interval ET profiles.•Time series forecasting of ET was performed using ensemble machine-learning models.•ET forecasting and genotype classification performance was compared at each scale.•60-min interval of ET was found optimum with minimum redundancy and information loss. Efficient selection of drought-tolerant crops requires identification and high-throughput phenotyping (HTP) of the complex functional (especially canopy-conductance) traits that elicit plant responses to continually fluctuating environmental conditions. However, phenotyping of such dynamic physiology-based traits has been immensely challenging especially due to the limited availability of adequate methods that can provide continuous measurements of plant-water relations. Therefore, gravimetric phenotyping of plants is being increasingly used to allow one-to-one monitoring of plant-water relations and generate continuous evapotranspiration (ET) profiles. The gravimetric sensors or load cells can provide ET estimates at very high frequencies, e.g. 15-min interval, as chosen by the user. There is however, no study on understanding the optimum frequency or the sampling time at which ET needs to be monitored, such that data-redundancy, noise and processing overhead could be reduced. Hence, this paper makes a novel attempt in identifying the optimum sampling time for phenotyping ET from load cells time series. The proposed procedure includes an ensemble Machine-Learning (ML) approach for optimizing the sampling time through time series forecasting of ET profiles and classification of genotypes using the forecasted ET values. High-frequency load cells data from the LeasyScan, HTP platform, ICRISAT were used to derive the ET profiles at frequencies or scales varying from 15-min to 180-min, followed by ET forecasting and classification at each frequency. For both forecasting and classification, an ensemble of three ML algorithms i.e. Support Vector Machines (SVM), Artificial Neural Network (ANN) and Random Forests (RF) were leveraged. Consequently, the performance metrics (of both the operations) obtained from the ensemble were used to compute the entropy-based optimum sampling time. The results reveal that 60-min interval HTP data could be credibly used for both, forecasting ET as well as correctly classifying the genotypes.</description><subject>Algorithms</subject><subject>Artificial neural networks</subject><subject>Classification</subject><subject>Ensemble machine learning</subject><subject>Evapotranspiration</subject><subject>Forecasting</subject><subject>Gravimetry</subject><subject>High throughput phenotyping</subject><subject>Learning theory</subject><subject>Load cells</subject><subject>Machine learning</subject><subject>Noise monitoring</subject><subject>Optimization</subject><subject>Performance measurement</subject><subject>Redundancy</subject><subject>Resistance</subject><subject>Sampling</subject><subject>Sampling time optimization</subject><subject>Support vector machines</subject><subject>Time series</subject><subject>Time series classification</subject><subject>Time series forecasting</subject><subject>Very high frequencies</subject><issn>0168-1699</issn><issn>1872-7107</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9UU2LFDEUDKLguPoPPAQ895j0VzoXYVl0XVjYi57DS-f1dIbJh0l6Yf-LP9aM7dnTo4qqehRFyEfOjpzx8fP5OAcX4XRsWcsrNUjZviIHPom2EZyJ1-RQZVPDRynfknc5n1nFchIH8vvWU_QZnb4gdTCv1iO9ICRv_YlCjClUki4hUYMFk7Meig2ehoWWFWmIxbrN0QwuXq6WCvGvHJ8hhpLA52jT7oGcMWeHvtAlBUdXe1qbsqawnda4FRpX9KG8xGuOgQLvyZsFLhk__Ls35Oe3rz_uvjePT_cPd7ePzdxNrDTctNx04zKw0Rg5zqCNFsIMCDhqKYBLrqXuWd9pY8zQih4YDkzqsec906a7IZ_23Nr214a5qHPYkq8vVTvwse0mwWRV9btqTiHnhIuKyTpIL4ozdd1BndW-g7ruoPYdqu3LbsPa4NliUnm26Gc0NuFclAn2_wF_AEhdmCg</recordid><startdate>202103</startdate><enddate>202103</enddate><creator>Kar, Soumyashree</creator><creator>Purbey, Vikram Kumar</creator><creator>Suradhaniwar, Saurabh</creator><creator>Korbu, Lijalem Balcha</creator><creator>Kholová, Jana</creator><creator>Durbha, Surya S.</creator><creator>Adinarayana, J.</creator><creator>Vadez, Vincent</creator><general>Elsevier B.V</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>202103</creationdate><title>An ensemble machine learning approach for determination of the optimum sampling time for evapotranspiration assessment from high-throughput phenotyping data</title><author>Kar, Soumyashree ; Purbey, Vikram Kumar ; Suradhaniwar, Saurabh ; Korbu, Lijalem Balcha ; Kholová, Jana ; Durbha, Surya S. ; Adinarayana, J. ; Vadez, Vincent</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Artificial neural networks</topic><topic>Classification</topic><topic>Ensemble machine learning</topic><topic>Evapotranspiration</topic><topic>Forecasting</topic><topic>Gravimetry</topic><topic>High throughput phenotyping</topic><topic>Learning theory</topic><topic>Load cells</topic><topic>Machine learning</topic><topic>Noise monitoring</topic><topic>Optimization</topic><topic>Performance measurement</topic><topic>Redundancy</topic><topic>Resistance</topic><topic>Sampling</topic><topic>Sampling time optimization</topic><topic>Support vector machines</topic><topic>Time series</topic><topic>Time series classification</topic><topic>Time series forecasting</topic><topic>Very high frequencies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kar, Soumyashree</creatorcontrib><creatorcontrib>Purbey, Vikram Kumar</creatorcontrib><creatorcontrib>Suradhaniwar, Saurabh</creatorcontrib><creatorcontrib>Korbu, Lijalem Balcha</creatorcontrib><creatorcontrib>Kholová, Jana</creatorcontrib><creatorcontrib>Durbha, Surya S.</creatorcontrib><creatorcontrib>Adinarayana, J.</creatorcontrib><creatorcontrib>Vadez, Vincent</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Computers and electronics in agriculture</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kar, Soumyashree</au><au>Purbey, Vikram Kumar</au><au>Suradhaniwar, Saurabh</au><au>Korbu, Lijalem Balcha</au><au>Kholová, Jana</au><au>Durbha, Surya S.</au><au>Adinarayana, J.</au><au>Vadez, Vincent</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An ensemble machine learning approach for determination of the optimum sampling time for evapotranspiration assessment from high-throughput phenotyping data</atitle><jtitle>Computers and electronics in agriculture</jtitle><date>2021-03</date><risdate>2021</risdate><volume>182</volume><spage>105992</spage><pages>105992-</pages><artnum>105992</artnum><issn>0168-1699</issn><eissn>1872-7107</eissn><abstract>•High-frequency evapotranspiration (ET) phenotyping is key to drought-stress analyses.•Optimum ET sampling frequency identified by downscaling 15-min interval ET profiles.•Time series forecasting of ET was performed using ensemble machine-learning models.•ET forecasting and genotype classification performance was compared at each scale.•60-min interval of ET was found optimum with minimum redundancy and information loss. Efficient selection of drought-tolerant crops requires identification and high-throughput phenotyping (HTP) of the complex functional (especially canopy-conductance) traits that elicit plant responses to continually fluctuating environmental conditions. However, phenotyping of such dynamic physiology-based traits has been immensely challenging especially due to the limited availability of adequate methods that can provide continuous measurements of plant-water relations. Therefore, gravimetric phenotyping of plants is being increasingly used to allow one-to-one monitoring of plant-water relations and generate continuous evapotranspiration (ET) profiles. The gravimetric sensors or load cells can provide ET estimates at very high frequencies, e.g. 15-min interval, as chosen by the user. There is however, no study on understanding the optimum frequency or the sampling time at which ET needs to be monitored, such that data-redundancy, noise and processing overhead could be reduced. Hence, this paper makes a novel attempt in identifying the optimum sampling time for phenotyping ET from load cells time series. The proposed procedure includes an ensemble Machine-Learning (ML) approach for optimizing the sampling time through time series forecasting of ET profiles and classification of genotypes using the forecasted ET values. High-frequency load cells data from the LeasyScan, HTP platform, ICRISAT were used to derive the ET profiles at frequencies or scales varying from 15-min to 180-min, followed by ET forecasting and classification at each frequency. For both forecasting and classification, an ensemble of three ML algorithms i.e. Support Vector Machines (SVM), Artificial Neural Network (ANN) and Random Forests (RF) were leveraged. Consequently, the performance metrics (of both the operations) obtained from the ensemble were used to compute the entropy-based optimum sampling time. The results reveal that 60-min interval HTP data could be credibly used for both, forecasting ET as well as correctly classifying the genotypes.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.compag.2021.105992</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0168-1699
ispartof Computers and electronics in agriculture, 2021-03, Vol.182, p.105992, Article 105992
issn 0168-1699
1872-7107
language eng
recordid cdi_proquest_journals_2516238709
source Elsevier
subjects Algorithms
Artificial neural networks
Classification
Ensemble machine learning
Evapotranspiration
Forecasting
Gravimetry
High throughput phenotyping
Learning theory
Load cells
Machine learning
Noise monitoring
Optimization
Performance measurement
Redundancy
Resistance
Sampling
Sampling time optimization
Support vector machines
Time series
Time series classification
Time series forecasting
Very high frequencies
title An ensemble machine learning approach for determination of the optimum sampling time for evapotranspiration assessment from high-throughput phenotyping data
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T04%3A12%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20ensemble%20machine%20learning%20approach%20for%20determination%20of%20the%20optimum%20sampling%20time%20for%20evapotranspiration%20assessment%20from%20high-throughput%20phenotyping%20data&rft.jtitle=Computers%20and%20electronics%20in%20agriculture&rft.au=Kar,%20Soumyashree&rft.date=2021-03&rft.volume=182&rft.spage=105992&rft.pages=105992-&rft.artnum=105992&rft.issn=0168-1699&rft.eissn=1872-7107&rft_id=info:doi/10.1016/j.compag.2021.105992&rft_dat=%3Cproquest_cross%3E2516238709%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2516238709&rft_id=info:pmid/&rfr_iscdi=true