Loading…

Model selection and timing of acquisition date impacts classification accuracy: A case study using hyperspectral imaging to detect white pine blister rust over time

•AUC is a better indicator of model fitness for extrapolation than predicted accuracy.•Heterogeneous ensembles are recommended for extrapolation over time.•Detecting infection in future dates is easier than for past dates. Hyperspectral imaging is useful in identifying plant stress over large areas...

Full description

Saved in:
Bibliographic Details
Published in:Computers and electronics in agriculture 2021-12, Vol.191, p.106555, Article 106555
Main Authors: Haagsma, Marja, Page, Gerald F.M., Johnson, Jeremy S., Still, Christopher, Waring, Kristen M., Sniezko, Richard A., Selker, John S.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c380t-ad1bdf15ab98f2366bf74e37bb8a6d078777317c90b8078be6b19eba5e6ecd003
cites cdi_FETCH-LOGICAL-c380t-ad1bdf15ab98f2366bf74e37bb8a6d078777317c90b8078be6b19eba5e6ecd003
container_end_page
container_issue
container_start_page 106555
container_title Computers and electronics in agriculture
container_volume 191
creator Haagsma, Marja
Page, Gerald F.M.
Johnson, Jeremy S.
Still, Christopher
Waring, Kristen M.
Sniezko, Richard A.
Selker, John S.
description •AUC is a better indicator of model fitness for extrapolation than predicted accuracy.•Heterogeneous ensembles are recommended for extrapolation over time.•Detecting infection in future dates is easier than for past dates. Hyperspectral imaging is useful in identifying plant stress over large areas or with large numbers of individuals. The vast data sets make machine learning indispensable, but the choice of machine-learning model, the accuracy of models in extrapolation over time (dynamic data), and timing of measurements require further elucidation. We assessed two metrics of performance for selection of classification model: the predicted accuracy (PA); and the area under the receiver-operating characteristic curve (AUC), both from a 10-fold cross-validation. These metrics were calculated for 22 models that were trained to track white pine blister rust disease in seedlings of southwestern white pine (Pinus strobiformis) on 16 dates. In static data (training and testing data are randomly picked from all dates) PA was comparable with AUC at ranking the models for tested accuracy (Spearman’s rank correlation coefficient, hereafter referred to as Spearman’s ρ, were 0.58 and 0.54, respectively). However, for dynamic data (training and testing data came from different dates) AUC was more successful at ranking the models for tested accuracy compared to PA (Spearman’s ρ were 0.37 and 0.31, respectively). Classification accuracies were 74.3 % and 75.8 % for the top PA and AUC models when applied to dynamic data. However, using a heterogeneous ensemble output, the accuracies increased to 77.3% (PA) and 77.6% (AUC). In comparison, if we selected the models based on the tested accuracies (which would not be possible in a real-life application), the best accuracy was 77.7% for a support-vector machine with a linear kernel. Classification accuracy was affected by the size of the time gap between training and testing dates as well as the timing of training and test date. The decline in accuracy with time lag was asymmetric, being more pronounced in classifying retrospectively, i.e., when the testing date came before the training date, than vice versa. Thus, for this system training a model on an early date resulted in higher accuracies than training a model on a later date. As for the timing, the highest average accuracies were obtained with a classifier trained on data acquired during the onset of the disease, which in this study was on DOY 116.
doi_str_mv 10.1016/j.compag.2021.106555
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2619667087</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S016816992100572X</els_id><sourcerecordid>2619667087</sourcerecordid><originalsourceid>FETCH-LOGICAL-c380t-ad1bdf15ab98f2366bf74e37bb8a6d078777317c90b8078be6b19eba5e6ecd003</originalsourceid><addsrcrecordid>eNp9UU1v1DAQtRBILIV_wMES5yx20tgOB6SqolCpiAucLX9Mtl6lcepxivb_8EOZJZx7suf5zXszfoy9l2IvhVQfj_uQHxZ32LeilQSpvu9fsJ00um20FPol2xHNNFINw2v2BvEoqB6M3rE_33OEiSNMEGrKM3dz5DU9pPnA88hdeFwTpn8v0VXgiXxCRR4mh5jGFNzWFcJaXDh94lc8OASOdY0nvuJZ5_60QMGFDIqbSMEdzmjNPEIlkP--T6S8pBm4nxJWKLysWHl-ohvNAm_Zq9FNCO_-nxfs182Xn9ffmrsfX2-vr-6a0BlRGxelj6PsnR_M2HZK-VFfQqe9N05FoY3WupM6DMIbqjwoLwfwrgcFIQrRXbAPm-5S8uMKWO0xr2UmS9sqOSilhdHEutxYoWTEAqNdCi1VTlYKe87DHu2Whz3nYbc8qO3z1ga0wVOCYjEkmAPEVOgXbMzpeYG_QYuZ-w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2619667087</pqid></control><display><type>article</type><title>Model selection and timing of acquisition date impacts classification accuracy: A case study using hyperspectral imaging to detect white pine blister rust over time</title><source>ScienceDirect Freedom Collection</source><creator>Haagsma, Marja ; Page, Gerald F.M. ; Johnson, Jeremy S. ; Still, Christopher ; Waring, Kristen M. ; Sniezko, Richard A. ; Selker, John S.</creator><creatorcontrib>Haagsma, Marja ; Page, Gerald F.M. ; Johnson, Jeremy S. ; Still, Christopher ; Waring, Kristen M. ; Sniezko, Richard A. ; Selker, John S.</creatorcontrib><description>•AUC is a better indicator of model fitness for extrapolation than predicted accuracy.•Heterogeneous ensembles are recommended for extrapolation over time.•Detecting infection in future dates is easier than for past dates. Hyperspectral imaging is useful in identifying plant stress over large areas or with large numbers of individuals. The vast data sets make machine learning indispensable, but the choice of machine-learning model, the accuracy of models in extrapolation over time (dynamic data), and timing of measurements require further elucidation. We assessed two metrics of performance for selection of classification model: the predicted accuracy (PA); and the area under the receiver-operating characteristic curve (AUC), both from a 10-fold cross-validation. These metrics were calculated for 22 models that were trained to track white pine blister rust disease in seedlings of southwestern white pine (Pinus strobiformis) on 16 dates. In static data (training and testing data are randomly picked from all dates) PA was comparable with AUC at ranking the models for tested accuracy (Spearman’s rank correlation coefficient, hereafter referred to as Spearman’s ρ, were 0.58 and 0.54, respectively). However, for dynamic data (training and testing data came from different dates) AUC was more successful at ranking the models for tested accuracy compared to PA (Spearman’s ρ were 0.37 and 0.31, respectively). Classification accuracies were 74.3 % and 75.8 % for the top PA and AUC models when applied to dynamic data. However, using a heterogeneous ensemble output, the accuracies increased to 77.3% (PA) and 77.6% (AUC). In comparison, if we selected the models based on the tested accuracies (which would not be possible in a real-life application), the best accuracy was 77.7% for a support-vector machine with a linear kernel. Classification accuracy was affected by the size of the time gap between training and testing dates as well as the timing of training and test date. The decline in accuracy with time lag was asymmetric, being more pronounced in classifying retrospectively, i.e., when the testing date came before the training date, than vice versa. Thus, for this system training a model on an early date resulted in higher accuracies than training a model on a later date. As for the timing, the highest average accuracies were obtained with a classifier trained on data acquired during the onset of the disease, which in this study was on DOY 116.</description><identifier>ISSN: 0168-1699</identifier><identifier>EISSN: 1872-7107</identifier><identifier>DOI: 10.1016/j.compag.2021.106555</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Accuracy ; Blistering ; Classification ; Correlation coefficients ; Data acquisition ; Digital phenotyping ; Heterogeneous ensemble ; Hyperspectral imaging ; Kernel functions ; Machine learning ; Model accuracy ; Model selection ; Phenological change ; Plant stress ; Ranking ; Support vector machines ; Time lag ; Time measurement ; Training</subject><ispartof>Computers and electronics in agriculture, 2021-12, Vol.191, p.106555, Article 106555</ispartof><rights>2021 Elsevier B.V.</rights><rights>Copyright Elsevier BV Dec 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c380t-ad1bdf15ab98f2366bf74e37bb8a6d078777317c90b8078be6b19eba5e6ecd003</citedby><cites>FETCH-LOGICAL-c380t-ad1bdf15ab98f2366bf74e37bb8a6d078777317c90b8078be6b19eba5e6ecd003</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Haagsma, Marja</creatorcontrib><creatorcontrib>Page, Gerald F.M.</creatorcontrib><creatorcontrib>Johnson, Jeremy S.</creatorcontrib><creatorcontrib>Still, Christopher</creatorcontrib><creatorcontrib>Waring, Kristen M.</creatorcontrib><creatorcontrib>Sniezko, Richard A.</creatorcontrib><creatorcontrib>Selker, John S.</creatorcontrib><title>Model selection and timing of acquisition date impacts classification accuracy: A case study using hyperspectral imaging to detect white pine blister rust over time</title><title>Computers and electronics in agriculture</title><description>•AUC is a better indicator of model fitness for extrapolation than predicted accuracy.•Heterogeneous ensembles are recommended for extrapolation over time.•Detecting infection in future dates is easier than for past dates. Hyperspectral imaging is useful in identifying plant stress over large areas or with large numbers of individuals. The vast data sets make machine learning indispensable, but the choice of machine-learning model, the accuracy of models in extrapolation over time (dynamic data), and timing of measurements require further elucidation. We assessed two metrics of performance for selection of classification model: the predicted accuracy (PA); and the area under the receiver-operating characteristic curve (AUC), both from a 10-fold cross-validation. These metrics were calculated for 22 models that were trained to track white pine blister rust disease in seedlings of southwestern white pine (Pinus strobiformis) on 16 dates. In static data (training and testing data are randomly picked from all dates) PA was comparable with AUC at ranking the models for tested accuracy (Spearman’s rank correlation coefficient, hereafter referred to as Spearman’s ρ, were 0.58 and 0.54, respectively). However, for dynamic data (training and testing data came from different dates) AUC was more successful at ranking the models for tested accuracy compared to PA (Spearman’s ρ were 0.37 and 0.31, respectively). Classification accuracies were 74.3 % and 75.8 % for the top PA and AUC models when applied to dynamic data. However, using a heterogeneous ensemble output, the accuracies increased to 77.3% (PA) and 77.6% (AUC). In comparison, if we selected the models based on the tested accuracies (which would not be possible in a real-life application), the best accuracy was 77.7% for a support-vector machine with a linear kernel. Classification accuracy was affected by the size of the time gap between training and testing dates as well as the timing of training and test date. The decline in accuracy with time lag was asymmetric, being more pronounced in classifying retrospectively, i.e., when the testing date came before the training date, than vice versa. Thus, for this system training a model on an early date resulted in higher accuracies than training a model on a later date. As for the timing, the highest average accuracies were obtained with a classifier trained on data acquired during the onset of the disease, which in this study was on DOY 116.</description><subject>Accuracy</subject><subject>Blistering</subject><subject>Classification</subject><subject>Correlation coefficients</subject><subject>Data acquisition</subject><subject>Digital phenotyping</subject><subject>Heterogeneous ensemble</subject><subject>Hyperspectral imaging</subject><subject>Kernel functions</subject><subject>Machine learning</subject><subject>Model accuracy</subject><subject>Model selection</subject><subject>Phenological change</subject><subject>Plant stress</subject><subject>Ranking</subject><subject>Support vector machines</subject><subject>Time lag</subject><subject>Time measurement</subject><subject>Training</subject><issn>0168-1699</issn><issn>1872-7107</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9UU1v1DAQtRBILIV_wMES5yx20tgOB6SqolCpiAucLX9Mtl6lcepxivb_8EOZJZx7suf5zXszfoy9l2IvhVQfj_uQHxZ32LeilQSpvu9fsJ00um20FPol2xHNNFINw2v2BvEoqB6M3rE_33OEiSNMEGrKM3dz5DU9pPnA88hdeFwTpn8v0VXgiXxCRR4mh5jGFNzWFcJaXDh94lc8OASOdY0nvuJZ5_60QMGFDIqbSMEdzmjNPEIlkP--T6S8pBm4nxJWKLysWHl-ohvNAm_Zq9FNCO_-nxfs182Xn9ffmrsfX2-vr-6a0BlRGxelj6PsnR_M2HZK-VFfQqe9N05FoY3WupM6DMIbqjwoLwfwrgcFIQrRXbAPm-5S8uMKWO0xr2UmS9sqOSilhdHEutxYoWTEAqNdCi1VTlYKe87DHu2Whz3nYbc8qO3z1ga0wVOCYjEkmAPEVOgXbMzpeYG_QYuZ-w</recordid><startdate>202112</startdate><enddate>202112</enddate><creator>Haagsma, Marja</creator><creator>Page, Gerald F.M.</creator><creator>Johnson, Jeremy S.</creator><creator>Still, Christopher</creator><creator>Waring, Kristen M.</creator><creator>Sniezko, Richard A.</creator><creator>Selker, John S.</creator><general>Elsevier B.V</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>KR7</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>202112</creationdate><title>Model selection and timing of acquisition date impacts classification accuracy: A case study using hyperspectral imaging to detect white pine blister rust over time</title><author>Haagsma, Marja ; Page, Gerald F.M. ; Johnson, Jeremy S. ; Still, Christopher ; Waring, Kristen M. ; Sniezko, Richard A. ; Selker, John S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c380t-ad1bdf15ab98f2366bf74e37bb8a6d078777317c90b8078be6b19eba5e6ecd003</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Accuracy</topic><topic>Blistering</topic><topic>Classification</topic><topic>Correlation coefficients</topic><topic>Data acquisition</topic><topic>Digital phenotyping</topic><topic>Heterogeneous ensemble</topic><topic>Hyperspectral imaging</topic><topic>Kernel functions</topic><topic>Machine learning</topic><topic>Model accuracy</topic><topic>Model selection</topic><topic>Phenological change</topic><topic>Plant stress</topic><topic>Ranking</topic><topic>Support vector machines</topic><topic>Time lag</topic><topic>Time measurement</topic><topic>Training</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Haagsma, Marja</creatorcontrib><creatorcontrib>Page, Gerald F.M.</creatorcontrib><creatorcontrib>Johnson, Jeremy S.</creatorcontrib><creatorcontrib>Still, Christopher</creatorcontrib><creatorcontrib>Waring, Kristen M.</creatorcontrib><creatorcontrib>Sniezko, Richard A.</creatorcontrib><creatorcontrib>Selker, John S.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Computers and electronics in agriculture</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Haagsma, Marja</au><au>Page, Gerald F.M.</au><au>Johnson, Jeremy S.</au><au>Still, Christopher</au><au>Waring, Kristen M.</au><au>Sniezko, Richard A.</au><au>Selker, John S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Model selection and timing of acquisition date impacts classification accuracy: A case study using hyperspectral imaging to detect white pine blister rust over time</atitle><jtitle>Computers and electronics in agriculture</jtitle><date>2021-12</date><risdate>2021</risdate><volume>191</volume><spage>106555</spage><pages>106555-</pages><artnum>106555</artnum><issn>0168-1699</issn><eissn>1872-7107</eissn><abstract>•AUC is a better indicator of model fitness for extrapolation than predicted accuracy.•Heterogeneous ensembles are recommended for extrapolation over time.•Detecting infection in future dates is easier than for past dates. Hyperspectral imaging is useful in identifying plant stress over large areas or with large numbers of individuals. The vast data sets make machine learning indispensable, but the choice of machine-learning model, the accuracy of models in extrapolation over time (dynamic data), and timing of measurements require further elucidation. We assessed two metrics of performance for selection of classification model: the predicted accuracy (PA); and the area under the receiver-operating characteristic curve (AUC), both from a 10-fold cross-validation. These metrics were calculated for 22 models that were trained to track white pine blister rust disease in seedlings of southwestern white pine (Pinus strobiformis) on 16 dates. In static data (training and testing data are randomly picked from all dates) PA was comparable with AUC at ranking the models for tested accuracy (Spearman’s rank correlation coefficient, hereafter referred to as Spearman’s ρ, were 0.58 and 0.54, respectively). However, for dynamic data (training and testing data came from different dates) AUC was more successful at ranking the models for tested accuracy compared to PA (Spearman’s ρ were 0.37 and 0.31, respectively). Classification accuracies were 74.3 % and 75.8 % for the top PA and AUC models when applied to dynamic data. However, using a heterogeneous ensemble output, the accuracies increased to 77.3% (PA) and 77.6% (AUC). In comparison, if we selected the models based on the tested accuracies (which would not be possible in a real-life application), the best accuracy was 77.7% for a support-vector machine with a linear kernel. Classification accuracy was affected by the size of the time gap between training and testing dates as well as the timing of training and test date. The decline in accuracy with time lag was asymmetric, being more pronounced in classifying retrospectively, i.e., when the testing date came before the training date, than vice versa. Thus, for this system training a model on an early date resulted in higher accuracies than training a model on a later date. As for the timing, the highest average accuracies were obtained with a classifier trained on data acquired during the onset of the disease, which in this study was on DOY 116.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.compag.2021.106555</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0168-1699
ispartof Computers and electronics in agriculture, 2021-12, Vol.191, p.106555, Article 106555
issn 0168-1699
1872-7107
language eng
recordid cdi_proquest_journals_2619667087
source ScienceDirect Freedom Collection
subjects Accuracy
Blistering
Classification
Correlation coefficients
Data acquisition
Digital phenotyping
Heterogeneous ensemble
Hyperspectral imaging
Kernel functions
Machine learning
Model accuracy
Model selection
Phenological change
Plant stress
Ranking
Support vector machines
Time lag
Time measurement
Training
title Model selection and timing of acquisition date impacts classification accuracy: A case study using hyperspectral imaging to detect white pine blister rust over time
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T10%3A24%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Model%20selection%20and%20timing%20of%20acquisition%20date%20impacts%20classification%20accuracy:%20A%20case%20study%20using%20hyperspectral%20imaging%20to%20detect%20white%20pine%20blister%20rust%20over%20time&rft.jtitle=Computers%20and%20electronics%20in%20agriculture&rft.au=Haagsma,%20Marja&rft.date=2021-12&rft.volume=191&rft.spage=106555&rft.pages=106555-&rft.artnum=106555&rft.issn=0168-1699&rft.eissn=1872-7107&rft_id=info:doi/10.1016/j.compag.2021.106555&rft_dat=%3Cproquest_cross%3E2619667087%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c380t-ad1bdf15ab98f2366bf74e37bb8a6d078777317c90b8078be6b19eba5e6ecd003%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2619667087&rft_id=info:pmid/&rfr_iscdi=true