Loading…
An ensemble machine learning approach for determination of the optimum sampling time for evapotranspiration assessment from high-throughput phenotyping data
•High-frequency evapotranspiration (ET) phenotyping is key to drought-stress analyses.•Optimum ET sampling frequency identified by downscaling 15-min interval ET profiles.•Time series forecasting of ET was performed using ensemble machine-learning models.•ET forecasting and genotype classification p...
Saved in:
Published in: | Computers and electronics in agriculture 2021-03, Vol.182, p.105992, Article 105992 |
---|---|
Main Authors: | , , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3 |
---|---|
cites | cdi_FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3 |
container_end_page | |
container_issue | |
container_start_page | 105992 |
container_title | Computers and electronics in agriculture |
container_volume | 182 |
creator | Kar, Soumyashree Purbey, Vikram Kumar Suradhaniwar, Saurabh Korbu, Lijalem Balcha Kholová, Jana Durbha, Surya S. Adinarayana, J. Vadez, Vincent |
description | •High-frequency evapotranspiration (ET) phenotyping is key to drought-stress analyses.•Optimum ET sampling frequency identified by downscaling 15-min interval ET profiles.•Time series forecasting of ET was performed using ensemble machine-learning models.•ET forecasting and genotype classification performance was compared at each scale.•60-min interval of ET was found optimum with minimum redundancy and information loss.
Efficient selection of drought-tolerant crops requires identification and high-throughput phenotyping (HTP) of the complex functional (especially canopy-conductance) traits that elicit plant responses to continually fluctuating environmental conditions. However, phenotyping of such dynamic physiology-based traits has been immensely challenging especially due to the limited availability of adequate methods that can provide continuous measurements of plant-water relations. Therefore, gravimetric phenotyping of plants is being increasingly used to allow one-to-one monitoring of plant-water relations and generate continuous evapotranspiration (ET) profiles. The gravimetric sensors or load cells can provide ET estimates at very high frequencies, e.g. 15-min interval, as chosen by the user. There is however, no study on understanding the optimum frequency or the sampling time at which ET needs to be monitored, such that data-redundancy, noise and processing overhead could be reduced. Hence, this paper makes a novel attempt in identifying the optimum sampling time for phenotyping ET from load cells time series. The proposed procedure includes an ensemble Machine-Learning (ML) approach for optimizing the sampling time through time series forecasting of ET profiles and classification of genotypes using the forecasted ET values. High-frequency load cells data from the LeasyScan, HTP platform, ICRISAT were used to derive the ET profiles at frequencies or scales varying from 15-min to 180-min, followed by ET forecasting and classification at each frequency. For both forecasting and classification, an ensemble of three ML algorithms i.e. Support Vector Machines (SVM), Artificial Neural Network (ANN) and Random Forests (RF) were leveraged. Consequently, the performance metrics (of both the operations) obtained from the ensemble were used to compute the entropy-based optimum sampling time. The results reveal that 60-min interval HTP data could be credibly used for both, forecasting ET as well as correctly classifying the genotypes. |
doi_str_mv | 10.1016/j.compag.2021.105992 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2516238709</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0168169921000107</els_id><sourcerecordid>2516238709</sourcerecordid><originalsourceid>FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3</originalsourceid><addsrcrecordid>eNp9UU2LFDEUDKLguPoPPAQ895j0VzoXYVl0XVjYi57DS-f1dIbJh0l6Yf-LP9aM7dnTo4qqehRFyEfOjpzx8fP5OAcX4XRsWcsrNUjZviIHPom2EZyJ1-RQZVPDRynfknc5n1nFchIH8vvWU_QZnb4gdTCv1iO9ICRv_YlCjClUki4hUYMFk7Meig2ehoWWFWmIxbrN0QwuXq6WCvGvHJ8hhpLA52jT7oGcMWeHvtAlBUdXe1qbsqawnda4FRpX9KG8xGuOgQLvyZsFLhk__Ls35Oe3rz_uvjePT_cPd7ePzdxNrDTctNx04zKw0Rg5zqCNFsIMCDhqKYBLrqXuWd9pY8zQih4YDkzqsec906a7IZ_23Nr214a5qHPYkq8vVTvwse0mwWRV9btqTiHnhIuKyTpIL4ozdd1BndW-g7ruoPYdqu3LbsPa4NliUnm26Gc0NuFclAn2_wF_AEhdmCg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2516238709</pqid></control><display><type>article</type><title>An ensemble machine learning approach for determination of the optimum sampling time for evapotranspiration assessment from high-throughput phenotyping data</title><source>Elsevier</source><creator>Kar, Soumyashree ; Purbey, Vikram Kumar ; Suradhaniwar, Saurabh ; Korbu, Lijalem Balcha ; Kholová, Jana ; Durbha, Surya S. ; Adinarayana, J. ; Vadez, Vincent</creator><creatorcontrib>Kar, Soumyashree ; Purbey, Vikram Kumar ; Suradhaniwar, Saurabh ; Korbu, Lijalem Balcha ; Kholová, Jana ; Durbha, Surya S. ; Adinarayana, J. ; Vadez, Vincent</creatorcontrib><description>•High-frequency evapotranspiration (ET) phenotyping is key to drought-stress analyses.•Optimum ET sampling frequency identified by downscaling 15-min interval ET profiles.•Time series forecasting of ET was performed using ensemble machine-learning models.•ET forecasting and genotype classification performance was compared at each scale.•60-min interval of ET was found optimum with minimum redundancy and information loss.
Efficient selection of drought-tolerant crops requires identification and high-throughput phenotyping (HTP) of the complex functional (especially canopy-conductance) traits that elicit plant responses to continually fluctuating environmental conditions. However, phenotyping of such dynamic physiology-based traits has been immensely challenging especially due to the limited availability of adequate methods that can provide continuous measurements of plant-water relations. Therefore, gravimetric phenotyping of plants is being increasingly used to allow one-to-one monitoring of plant-water relations and generate continuous evapotranspiration (ET) profiles. The gravimetric sensors or load cells can provide ET estimates at very high frequencies, e.g. 15-min interval, as chosen by the user. There is however, no study on understanding the optimum frequency or the sampling time at which ET needs to be monitored, such that data-redundancy, noise and processing overhead could be reduced. Hence, this paper makes a novel attempt in identifying the optimum sampling time for phenotyping ET from load cells time series. The proposed procedure includes an ensemble Machine-Learning (ML) approach for optimizing the sampling time through time series forecasting of ET profiles and classification of genotypes using the forecasted ET values. High-frequency load cells data from the LeasyScan, HTP platform, ICRISAT were used to derive the ET profiles at frequencies or scales varying from 15-min to 180-min, followed by ET forecasting and classification at each frequency. For both forecasting and classification, an ensemble of three ML algorithms i.e. Support Vector Machines (SVM), Artificial Neural Network (ANN) and Random Forests (RF) were leveraged. Consequently, the performance metrics (of both the operations) obtained from the ensemble were used to compute the entropy-based optimum sampling time. The results reveal that 60-min interval HTP data could be credibly used for both, forecasting ET as well as correctly classifying the genotypes.</description><identifier>ISSN: 0168-1699</identifier><identifier>EISSN: 1872-7107</identifier><identifier>DOI: 10.1016/j.compag.2021.105992</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Algorithms ; Artificial neural networks ; Classification ; Ensemble machine learning ; Evapotranspiration ; Forecasting ; Gravimetry ; High throughput phenotyping ; Learning theory ; Load cells ; Machine learning ; Noise monitoring ; Optimization ; Performance measurement ; Redundancy ; Resistance ; Sampling ; Sampling time optimization ; Support vector machines ; Time series ; Time series classification ; Time series forecasting ; Very high frequencies</subject><ispartof>Computers and electronics in agriculture, 2021-03, Vol.182, p.105992, Article 105992</ispartof><rights>2021 Elsevier B.V.</rights><rights>Copyright Elsevier BV Mar 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3</citedby><cites>FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Kar, Soumyashree</creatorcontrib><creatorcontrib>Purbey, Vikram Kumar</creatorcontrib><creatorcontrib>Suradhaniwar, Saurabh</creatorcontrib><creatorcontrib>Korbu, Lijalem Balcha</creatorcontrib><creatorcontrib>Kholová, Jana</creatorcontrib><creatorcontrib>Durbha, Surya S.</creatorcontrib><creatorcontrib>Adinarayana, J.</creatorcontrib><creatorcontrib>Vadez, Vincent</creatorcontrib><title>An ensemble machine learning approach for determination of the optimum sampling time for evapotranspiration assessment from high-throughput phenotyping data</title><title>Computers and electronics in agriculture</title><description>•High-frequency evapotranspiration (ET) phenotyping is key to drought-stress analyses.•Optimum ET sampling frequency identified by downscaling 15-min interval ET profiles.•Time series forecasting of ET was performed using ensemble machine-learning models.•ET forecasting and genotype classification performance was compared at each scale.•60-min interval of ET was found optimum with minimum redundancy and information loss.
Efficient selection of drought-tolerant crops requires identification and high-throughput phenotyping (HTP) of the complex functional (especially canopy-conductance) traits that elicit plant responses to continually fluctuating environmental conditions. However, phenotyping of such dynamic physiology-based traits has been immensely challenging especially due to the limited availability of adequate methods that can provide continuous measurements of plant-water relations. Therefore, gravimetric phenotyping of plants is being increasingly used to allow one-to-one monitoring of plant-water relations and generate continuous evapotranspiration (ET) profiles. The gravimetric sensors or load cells can provide ET estimates at very high frequencies, e.g. 15-min interval, as chosen by the user. There is however, no study on understanding the optimum frequency or the sampling time at which ET needs to be monitored, such that data-redundancy, noise and processing overhead could be reduced. Hence, this paper makes a novel attempt in identifying the optimum sampling time for phenotyping ET from load cells time series. The proposed procedure includes an ensemble Machine-Learning (ML) approach for optimizing the sampling time through time series forecasting of ET profiles and classification of genotypes using the forecasted ET values. High-frequency load cells data from the LeasyScan, HTP platform, ICRISAT were used to derive the ET profiles at frequencies or scales varying from 15-min to 180-min, followed by ET forecasting and classification at each frequency. For both forecasting and classification, an ensemble of three ML algorithms i.e. Support Vector Machines (SVM), Artificial Neural Network (ANN) and Random Forests (RF) were leveraged. Consequently, the performance metrics (of both the operations) obtained from the ensemble were used to compute the entropy-based optimum sampling time. The results reveal that 60-min interval HTP data could be credibly used for both, forecasting ET as well as correctly classifying the genotypes.</description><subject>Algorithms</subject><subject>Artificial neural networks</subject><subject>Classification</subject><subject>Ensemble machine learning</subject><subject>Evapotranspiration</subject><subject>Forecasting</subject><subject>Gravimetry</subject><subject>High throughput phenotyping</subject><subject>Learning theory</subject><subject>Load cells</subject><subject>Machine learning</subject><subject>Noise monitoring</subject><subject>Optimization</subject><subject>Performance measurement</subject><subject>Redundancy</subject><subject>Resistance</subject><subject>Sampling</subject><subject>Sampling time optimization</subject><subject>Support vector machines</subject><subject>Time series</subject><subject>Time series classification</subject><subject>Time series forecasting</subject><subject>Very high frequencies</subject><issn>0168-1699</issn><issn>1872-7107</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9UU2LFDEUDKLguPoPPAQ895j0VzoXYVl0XVjYi57DS-f1dIbJh0l6Yf-LP9aM7dnTo4qqehRFyEfOjpzx8fP5OAcX4XRsWcsrNUjZviIHPom2EZyJ1-RQZVPDRynfknc5n1nFchIH8vvWU_QZnb4gdTCv1iO9ICRv_YlCjClUki4hUYMFk7Meig2ehoWWFWmIxbrN0QwuXq6WCvGvHJ8hhpLA52jT7oGcMWeHvtAlBUdXe1qbsqawnda4FRpX9KG8xGuOgQLvyZsFLhk__Ls35Oe3rz_uvjePT_cPd7ePzdxNrDTctNx04zKw0Rg5zqCNFsIMCDhqKYBLrqXuWd9pY8zQih4YDkzqsec906a7IZ_23Nr214a5qHPYkq8vVTvwse0mwWRV9btqTiHnhIuKyTpIL4ozdd1BndW-g7ruoPYdqu3LbsPa4NliUnm26Gc0NuFclAn2_wF_AEhdmCg</recordid><startdate>202103</startdate><enddate>202103</enddate><creator>Kar, Soumyashree</creator><creator>Purbey, Vikram Kumar</creator><creator>Suradhaniwar, Saurabh</creator><creator>Korbu, Lijalem Balcha</creator><creator>Kholová, Jana</creator><creator>Durbha, Surya S.</creator><creator>Adinarayana, J.</creator><creator>Vadez, Vincent</creator><general>Elsevier B.V</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>202103</creationdate><title>An ensemble machine learning approach for determination of the optimum sampling time for evapotranspiration assessment from high-throughput phenotyping data</title><author>Kar, Soumyashree ; Purbey, Vikram Kumar ; Suradhaniwar, Saurabh ; Korbu, Lijalem Balcha ; Kholová, Jana ; Durbha, Surya S. ; Adinarayana, J. ; Vadez, Vincent</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Artificial neural networks</topic><topic>Classification</topic><topic>Ensemble machine learning</topic><topic>Evapotranspiration</topic><topic>Forecasting</topic><topic>Gravimetry</topic><topic>High throughput phenotyping</topic><topic>Learning theory</topic><topic>Load cells</topic><topic>Machine learning</topic><topic>Noise monitoring</topic><topic>Optimization</topic><topic>Performance measurement</topic><topic>Redundancy</topic><topic>Resistance</topic><topic>Sampling</topic><topic>Sampling time optimization</topic><topic>Support vector machines</topic><topic>Time series</topic><topic>Time series classification</topic><topic>Time series forecasting</topic><topic>Very high frequencies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kar, Soumyashree</creatorcontrib><creatorcontrib>Purbey, Vikram Kumar</creatorcontrib><creatorcontrib>Suradhaniwar, Saurabh</creatorcontrib><creatorcontrib>Korbu, Lijalem Balcha</creatorcontrib><creatorcontrib>Kholová, Jana</creatorcontrib><creatorcontrib>Durbha, Surya S.</creatorcontrib><creatorcontrib>Adinarayana, J.</creatorcontrib><creatorcontrib>Vadez, Vincent</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Computers and electronics in agriculture</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kar, Soumyashree</au><au>Purbey, Vikram Kumar</au><au>Suradhaniwar, Saurabh</au><au>Korbu, Lijalem Balcha</au><au>Kholová, Jana</au><au>Durbha, Surya S.</au><au>Adinarayana, J.</au><au>Vadez, Vincent</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An ensemble machine learning approach for determination of the optimum sampling time for evapotranspiration assessment from high-throughput phenotyping data</atitle><jtitle>Computers and electronics in agriculture</jtitle><date>2021-03</date><risdate>2021</risdate><volume>182</volume><spage>105992</spage><pages>105992-</pages><artnum>105992</artnum><issn>0168-1699</issn><eissn>1872-7107</eissn><abstract>•High-frequency evapotranspiration (ET) phenotyping is key to drought-stress analyses.•Optimum ET sampling frequency identified by downscaling 15-min interval ET profiles.•Time series forecasting of ET was performed using ensemble machine-learning models.•ET forecasting and genotype classification performance was compared at each scale.•60-min interval of ET was found optimum with minimum redundancy and information loss.
Efficient selection of drought-tolerant crops requires identification and high-throughput phenotyping (HTP) of the complex functional (especially canopy-conductance) traits that elicit plant responses to continually fluctuating environmental conditions. However, phenotyping of such dynamic physiology-based traits has been immensely challenging especially due to the limited availability of adequate methods that can provide continuous measurements of plant-water relations. Therefore, gravimetric phenotyping of plants is being increasingly used to allow one-to-one monitoring of plant-water relations and generate continuous evapotranspiration (ET) profiles. The gravimetric sensors or load cells can provide ET estimates at very high frequencies, e.g. 15-min interval, as chosen by the user. There is however, no study on understanding the optimum frequency or the sampling time at which ET needs to be monitored, such that data-redundancy, noise and processing overhead could be reduced. Hence, this paper makes a novel attempt in identifying the optimum sampling time for phenotyping ET from load cells time series. The proposed procedure includes an ensemble Machine-Learning (ML) approach for optimizing the sampling time through time series forecasting of ET profiles and classification of genotypes using the forecasted ET values. High-frequency load cells data from the LeasyScan, HTP platform, ICRISAT were used to derive the ET profiles at frequencies or scales varying from 15-min to 180-min, followed by ET forecasting and classification at each frequency. For both forecasting and classification, an ensemble of three ML algorithms i.e. Support Vector Machines (SVM), Artificial Neural Network (ANN) and Random Forests (RF) were leveraged. Consequently, the performance metrics (of both the operations) obtained from the ensemble were used to compute the entropy-based optimum sampling time. The results reveal that 60-min interval HTP data could be credibly used for both, forecasting ET as well as correctly classifying the genotypes.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.compag.2021.105992</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0168-1699 |
ispartof | Computers and electronics in agriculture, 2021-03, Vol.182, p.105992, Article 105992 |
issn | 0168-1699 1872-7107 |
language | eng |
recordid | cdi_proquest_journals_2516238709 |
source | Elsevier |
subjects | Algorithms Artificial neural networks Classification Ensemble machine learning Evapotranspiration Forecasting Gravimetry High throughput phenotyping Learning theory Load cells Machine learning Noise monitoring Optimization Performance measurement Redundancy Resistance Sampling Sampling time optimization Support vector machines Time series Time series classification Time series forecasting Very high frequencies |
title | An ensemble machine learning approach for determination of the optimum sampling time for evapotranspiration assessment from high-throughput phenotyping data |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T04%3A12%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20ensemble%20machine%20learning%20approach%20for%20determination%20of%20the%20optimum%20sampling%20time%20for%20evapotranspiration%20assessment%20from%20high-throughput%20phenotyping%20data&rft.jtitle=Computers%20and%20electronics%20in%20agriculture&rft.au=Kar,%20Soumyashree&rft.date=2021-03&rft.volume=182&rft.spage=105992&rft.pages=105992-&rft.artnum=105992&rft.issn=0168-1699&rft.eissn=1872-7107&rft_id=info:doi/10.1016/j.compag.2021.105992&rft_dat=%3Cproquest_cross%3E2516238709%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c380t-1d21d36f506dd96cabdb77d5eae6b97a191b9b4043bddd5274a0e509b64140bd3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2516238709&rft_id=info:pmid/&rfr_iscdi=true |