Loading…

Scoring Algorithms for a Computer-Based Cognitive Screening Tool: An Illustrative Example of Overfitting Machine Learning Approaches and the Impact on Estimates of Classification Accuracy

Computerized cognitive screening tools, such as the self-administered Computerized Assessment of Memory Cognitive Impairment (CAMCI), require little training and ensure standardized administration and could be an ideal test for primary care settings. We conducted a secondary analysis of a data set i...

Full description

Saved in:

Bibliographic Details
Published in:	Psychological assessment 2019-11, Vol.31 (11), p.1377-1382
Main Authors:	Ursenbach, Jake, O'Connell, Megan E., Neiser, Jennafer, Tierney, Mary C., Morgan, Debra, Kosteniuk, Julie, Spiteri, Raymond J.
Format:	Article
Language:	English
Subjects:	Aged Aged, 80 and over Algorithms Cognitive ability Cognitive Assessment Cognitive Dysfunction - diagnosis Cognitive Dysfunction - psychology Cognitive Impairment Computer Assisted Instruction Computers Diagnosis, Computer-Assisted - methods Female Human Humans Logistic Regression Machine Learning Male Medical screening Mental disorders Middle Aged Mild Cognitive Impairment Neuropsychological Tests - standards Reproducibility of Results Scoring (Testing) Screening Tests Sensitivity and Specificity
Citations:	Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-a381t-d9217f8c8f4d6079d6957b65a56be38da0a7d1bb4a6d9df837f3587327920be63
cites
container_end_page	1382
container_issue	11
container_start_page	1377
container_title	Psychological assessment
container_volume	31
creator	Ursenbach, Jake O'Connell, Megan E. Neiser, Jennafer Tierney, Mary C. Morgan, Debra Kosteniuk, Julie Spiteri, Raymond J.
description	Computerized cognitive screening tools, such as the self-administered Computerized Assessment of Memory Cognitive Impairment (CAMCI), require little training and ensure standardized administration and could be an ideal test for primary care settings. We conducted a secondary analysis of a data set including 887 older adults (M age = 72.7 years, SD = 7.1 years; 32.1% male; M years education = 13.4, SD = 2.7 years) with CAMCI scores and independent diagnoses of mild cognitive impairment (MCI). A study by the CAMCI developers used a portion of this data set with a machine learning decision tree model and suggested that the CAMCI had high classification accuracy for MCI (sensitivity = 0.86, specificity = 0.94). We found similar support for accuracy (sensitivity = 0.94, specificity = 0.94) by overfitting a decision tree model, but we found evidence of lower accuracy in a cross-validation sample (sensitivity = 0.62, specificity = 0.66). A logistic regression model, however, discriminated modestly in both training (sensitivity = 0.72, specificity = 0.80) and cross-validation data sets (sensitivity = 0.69, specificity = 0.74). Evidence for strong accuracy when overfitting a decision tree model and substantially reduced accuracy in cross-validation samples was replicated across 500 bootstrapped samples. In contrast, the evidence for accuracy of the logistic regression model was similar in the training and cross-validation samples. The logistic regression model produced accuracy estimates consistent with other published CAMCI studies, suggesting evidence for classification accuracy of the CAMCI for MCI is likely modest. This case study illustrates the general need for cross-validation and careful evaluation of the generalizability of machine learning models. Public Significance Statement Using an archival database, a decision tree machine learning method demonstrated overfitting and had substantially reduced evidence for classification accuracy measured in cross-validation, but the statistical method results in similar evidence of classification in training and cross-validation samples. The evidence for classification accuracy of the Computerized Assessment of Mild Cognitive Impairment for cognitive impairment is modest, and this has clinical implications.
doi_str_mv	10.1037/pas0000764
format	article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2273756281</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2312226464</sourcerecordid><originalsourceid>FETCH-LOGICAL-a381t-d9217f8c8f4d6079d6957b65a56be38da0a7d1bb4a6d9df837f3587327920be63</originalsourceid><addsrcrecordid>eNp9kVGL1DAQx4so3nn64geQgC8iVJMmbVLfesuqCyv3cCf4VqbpZDdH29QkPdzP5pcze3sq-GAIZJL5zT_D_LPsJaPvGOXy_QyBpiUr8Sg7ZzWvc8bFt8cppoLmvKzpWfYshFtKmeCqfJqdcSaYUCU_z35ea-fttCPNsEtB3I-BGOcJkJUb5yWizy8hYJ-uu8lGe4fkWnvE6Vhz49zwgTQT2QzDEqKH-_z6B4zzgMQZcnWH3tgYj_AX0Hs7Idki-PvqZp69S48YCEw9iXskm3EGHYmbyDpEO0JMuSSzGiAEa6xOH6Rco_XiQR-eZ08MDAFfPJwX2deP65vV53x79WmzarY5cMVi3tcFk0ZpZURfUVn3VV3KriqhrDrkqgcKsmddJ6Dq694oLg0vleSFrAvaYcUvsjcn3dTv9wVDbEcbNA4DTOiW0BaF5LKsCsUS-vof9NYtfkrdtQVnRVFUohL_pZIWl4pKmqi3J0p7F4JH084-zcQfWkbbo_HtX-MT_OpBculG7P-gv51OQH4CYIZ2DgcNPlo9YEiz9DjFo1iiW5Y2l5L_Al6mulI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2273378070</pqid></control><display><type>article</type><title>Scoring Algorithms for a Computer-Based Cognitive Screening Tool: An Illustrative Example of Overfitting Machine Learning Approaches and the Impact on Estimates of Classification Accuracy</title><source>PsycARTICLES</source><creator>Ursenbach, Jake ; O'Connell, Megan E. ; Neiser, Jennafer ; Tierney, Mary C. ; Morgan, Debra ; Kosteniuk, Julie ; Spiteri, Raymond J.</creator><contributor>Ben-Porath, Yossef S</contributor><creatorcontrib>Ursenbach, Jake ; O'Connell, Megan E. ; Neiser, Jennafer ; Tierney, Mary C. ; Morgan, Debra ; Kosteniuk, Julie ; Spiteri, Raymond J. ; Ben-Porath, Yossef S</creatorcontrib><description>Computerized cognitive screening tools, such as the self-administered Computerized Assessment of Memory Cognitive Impairment (CAMCI), require little training and ensure standardized administration and could be an ideal test for primary care settings. We conducted a secondary analysis of a data set including 887 older adults (M age = 72.7 years, SD = 7.1 years; 32.1% male; M years education = 13.4, SD = 2.7 years) with CAMCI scores and independent diagnoses of mild cognitive impairment (MCI). A study by the CAMCI developers used a portion of this data set with a machine learning decision tree model and suggested that the CAMCI had high classification accuracy for MCI (sensitivity = 0.86, specificity = 0.94). We found similar support for accuracy (sensitivity = 0.94, specificity = 0.94) by overfitting a decision tree model, but we found evidence of lower accuracy in a cross-validation sample (sensitivity = 0.62, specificity = 0.66). A logistic regression model, however, discriminated modestly in both training (sensitivity = 0.72, specificity = 0.80) and cross-validation data sets (sensitivity = 0.69, specificity = 0.74). Evidence for strong accuracy when overfitting a decision tree model and substantially reduced accuracy in cross-validation samples was replicated across 500 bootstrapped samples. In contrast, the evidence for accuracy of the logistic regression model was similar in the training and cross-validation samples. The logistic regression model produced accuracy estimates consistent with other published CAMCI studies, suggesting evidence for classification accuracy of the CAMCI for MCI is likely modest. This case study illustrates the general need for cross-validation and careful evaluation of the generalizability of machine learning models. Public Significance Statement Using an archival database, a decision tree machine learning method demonstrated overfitting and had substantially reduced evidence for classification accuracy measured in cross-validation, but the statistical method results in similar evidence of classification in training and cross-validation samples. The evidence for classification accuracy of the Computerized Assessment of Mild Cognitive Impairment for cognitive impairment is modest, and this has clinical implications.</description><identifier>ISSN: 1040-3590</identifier><identifier>EISSN: 1939-134X</identifier><identifier>DOI: 10.1037/pas0000764</identifier><identifier>PMID: 31414853</identifier><language>eng</language><publisher>United States: American Psychological Association</publisher><subject>Aged ; Aged, 80 and over ; Algorithms ; Cognitive ability ; Cognitive Assessment ; Cognitive Dysfunction - diagnosis ; Cognitive Dysfunction - psychology ; Cognitive Impairment ; Computer Assisted Instruction ; Computers ; Diagnosis, Computer-Assisted - methods ; Female ; Human ; Humans ; Logistic Regression ; Machine Learning ; Male ; Medical screening ; Mental disorders ; Middle Aged ; Mild Cognitive Impairment ; Neuropsychological Tests - standards ; Reproducibility of Results ; Scoring (Testing) ; Screening Tests ; Sensitivity and Specificity</subject><ispartof>Psychological assessment, 2019-11, Vol.31 (11), p.1377-1382</ispartof><rights>2019 American Psychological Association</rights><rights>2019, American Psychological Association</rights><rights>Copyright American Psychological Association Nov 2019</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a381t-d9217f8c8f4d6079d6957b65a56be38da0a7d1bb4a6d9df837f3587327920be63</citedby><orcidid>0000-0001-6159-4322</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/31414853$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Ben-Porath, Yossef S</contributor><creatorcontrib>Ursenbach, Jake</creatorcontrib><creatorcontrib>O'Connell, Megan E.</creatorcontrib><creatorcontrib>Neiser, Jennafer</creatorcontrib><creatorcontrib>Tierney, Mary C.</creatorcontrib><creatorcontrib>Morgan, Debra</creatorcontrib><creatorcontrib>Kosteniuk, Julie</creatorcontrib><creatorcontrib>Spiteri, Raymond J.</creatorcontrib><title>Scoring Algorithms for a Computer-Based Cognitive Screening Tool: An Illustrative Example of Overfitting Machine Learning Approaches and the Impact on Estimates of Classification Accuracy</title><title>Psychological assessment</title><addtitle>Psychol Assess</addtitle><description>Computerized cognitive screening tools, such as the self-administered Computerized Assessment of Memory Cognitive Impairment (CAMCI), require little training and ensure standardized administration and could be an ideal test for primary care settings. We conducted a secondary analysis of a data set including 887 older adults (M age = 72.7 years, SD = 7.1 years; 32.1% male; M years education = 13.4, SD = 2.7 years) with CAMCI scores and independent diagnoses of mild cognitive impairment (MCI). A study by the CAMCI developers used a portion of this data set with a machine learning decision tree model and suggested that the CAMCI had high classification accuracy for MCI (sensitivity = 0.86, specificity = 0.94). We found similar support for accuracy (sensitivity = 0.94, specificity = 0.94) by overfitting a decision tree model, but we found evidence of lower accuracy in a cross-validation sample (sensitivity = 0.62, specificity = 0.66). A logistic regression model, however, discriminated modestly in both training (sensitivity = 0.72, specificity = 0.80) and cross-validation data sets (sensitivity = 0.69, specificity = 0.74). Evidence for strong accuracy when overfitting a decision tree model and substantially reduced accuracy in cross-validation samples was replicated across 500 bootstrapped samples. In contrast, the evidence for accuracy of the logistic regression model was similar in the training and cross-validation samples. The logistic regression model produced accuracy estimates consistent with other published CAMCI studies, suggesting evidence for classification accuracy of the CAMCI for MCI is likely modest. This case study illustrates the general need for cross-validation and careful evaluation of the generalizability of machine learning models. Public Significance Statement Using an archival database, a decision tree machine learning method demonstrated overfitting and had substantially reduced evidence for classification accuracy measured in cross-validation, but the statistical method results in similar evidence of classification in training and cross-validation samples. The evidence for classification accuracy of the Computerized Assessment of Mild Cognitive Impairment for cognitive impairment is modest, and this has clinical implications.</description><subject>Aged</subject><subject>Aged, 80 and over</subject><subject>Algorithms</subject><subject>Cognitive ability</subject><subject>Cognitive Assessment</subject><subject>Cognitive Dysfunction - diagnosis</subject><subject>Cognitive Dysfunction - psychology</subject><subject>Cognitive Impairment</subject><subject>Computer Assisted Instruction</subject><subject>Computers</subject><subject>Diagnosis, Computer-Assisted - methods</subject><subject>Female</subject><subject>Human</subject><subject>Humans</subject><subject>Logistic Regression</subject><subject>Machine Learning</subject><subject>Male</subject><subject>Medical screening</subject><subject>Mental disorders</subject><subject>Middle Aged</subject><subject>Mild Cognitive Impairment</subject><subject>Neuropsychological Tests - standards</subject><subject>Reproducibility of Results</subject><subject>Scoring (Testing)</subject><subject>Screening Tests</subject><subject>Sensitivity and Specificity</subject><issn>1040-3590</issn><issn>1939-134X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp9kVGL1DAQx4so3nn64geQgC8iVJMmbVLfesuqCyv3cCf4VqbpZDdH29QkPdzP5pcze3sq-GAIZJL5zT_D_LPsJaPvGOXy_QyBpiUr8Sg7ZzWvc8bFt8cppoLmvKzpWfYshFtKmeCqfJqdcSaYUCU_z35ea-fttCPNsEtB3I-BGOcJkJUb5yWizy8hYJ-uu8lGe4fkWnvE6Vhz49zwgTQT2QzDEqKH-_z6B4zzgMQZcnWH3tgYj_AX0Hs7Idki-PvqZp69S48YCEw9iXskm3EGHYmbyDpEO0JMuSSzGiAEa6xOH6Rco_XiQR-eZ08MDAFfPJwX2deP65vV53x79WmzarY5cMVi3tcFk0ZpZURfUVn3VV3KriqhrDrkqgcKsmddJ6Dq694oLg0vleSFrAvaYcUvsjcn3dTv9wVDbEcbNA4DTOiW0BaF5LKsCsUS-vof9NYtfkrdtQVnRVFUohL_pZIWl4pKmqi3J0p7F4JH084-zcQfWkbbo_HtX-MT_OpBculG7P-gv51OQH4CYIZ2DgcNPlo9YEiz9DjFo1iiW5Y2l5L_Al6mulI</recordid><startdate>20191101</startdate><enddate>20191101</enddate><creator>Ursenbach, Jake</creator><creator>O'Connell, Megan E.</creator><creator>Neiser, Jennafer</creator><creator>Tierney, Mary C.</creator><creator>Morgan, Debra</creator><creator>Kosteniuk, Julie</creator><creator>Spiteri, Raymond J.</creator><general>American Psychological Association</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7RZ</scope><scope>PSYQQ</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-6159-4322</orcidid></search><sort><creationdate>20191101</creationdate><title>Scoring Algorithms for a Computer-Based Cognitive Screening Tool: An Illustrative Example of Overfitting Machine Learning Approaches and the Impact on Estimates of Classification Accuracy</title><author>Ursenbach, Jake ; O'Connell, Megan E. ; Neiser, Jennafer ; Tierney, Mary C. ; Morgan, Debra ; Kosteniuk, Julie ; Spiteri, Raymond J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a381t-d9217f8c8f4d6079d6957b65a56be38da0a7d1bb4a6d9df837f3587327920be63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Aged</topic><topic>Aged, 80 and over</topic><topic>Algorithms</topic><topic>Cognitive ability</topic><topic>Cognitive Assessment</topic><topic>Cognitive Dysfunction - diagnosis</topic><topic>Cognitive Dysfunction - psychology</topic><topic>Cognitive Impairment</topic><topic>Computer Assisted Instruction</topic><topic>Computers</topic><topic>Diagnosis, Computer-Assisted - methods</topic><topic>Female</topic><topic>Human</topic><topic>Humans</topic><topic>Logistic Regression</topic><topic>Machine Learning</topic><topic>Male</topic><topic>Medical screening</topic><topic>Mental disorders</topic><topic>Middle Aged</topic><topic>Mild Cognitive Impairment</topic><topic>Neuropsychological Tests - standards</topic><topic>Reproducibility of Results</topic><topic>Scoring (Testing)</topic><topic>Screening Tests</topic><topic>Sensitivity and Specificity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ursenbach, Jake</creatorcontrib><creatorcontrib>O'Connell, Megan E.</creatorcontrib><creatorcontrib>Neiser, Jennafer</creatorcontrib><creatorcontrib>Tierney, Mary C.</creatorcontrib><creatorcontrib>Morgan, Debra</creatorcontrib><creatorcontrib>Kosteniuk, Julie</creatorcontrib><creatorcontrib>Spiteri, Raymond J.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>APA PsycArticles®</collection><collection>ProQuest One Psychology</collection><collection>MEDLINE - Academic</collection><jtitle>Psychological assessment</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ursenbach, Jake</au><au>O'Connell, Megan E.</au><au>Neiser, Jennafer</au><au>Tierney, Mary C.</au><au>Morgan, Debra</au><au>Kosteniuk, Julie</au><au>Spiteri, Raymond J.</au><au>Ben-Porath, Yossef S</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Scoring Algorithms for a Computer-Based Cognitive Screening Tool: An Illustrative Example of Overfitting Machine Learning Approaches and the Impact on Estimates of Classification Accuracy</atitle><jtitle>Psychological assessment</jtitle><addtitle>Psychol Assess</addtitle><date>2019-11-01</date><risdate>2019</risdate><volume>31</volume><issue>11</issue><spage>1377</spage><epage>1382</epage><pages>1377-1382</pages><issn>1040-3590</issn><eissn>1939-134X</eissn><abstract>Computerized cognitive screening tools, such as the self-administered Computerized Assessment of Memory Cognitive Impairment (CAMCI), require little training and ensure standardized administration and could be an ideal test for primary care settings. We conducted a secondary analysis of a data set including 887 older adults (M age = 72.7 years, SD = 7.1 years; 32.1% male; M years education = 13.4, SD = 2.7 years) with CAMCI scores and independent diagnoses of mild cognitive impairment (MCI). A study by the CAMCI developers used a portion of this data set with a machine learning decision tree model and suggested that the CAMCI had high classification accuracy for MCI (sensitivity = 0.86, specificity = 0.94). We found similar support for accuracy (sensitivity = 0.94, specificity = 0.94) by overfitting a decision tree model, but we found evidence of lower accuracy in a cross-validation sample (sensitivity = 0.62, specificity = 0.66). A logistic regression model, however, discriminated modestly in both training (sensitivity = 0.72, specificity = 0.80) and cross-validation data sets (sensitivity = 0.69, specificity = 0.74). Evidence for strong accuracy when overfitting a decision tree model and substantially reduced accuracy in cross-validation samples was replicated across 500 bootstrapped samples. In contrast, the evidence for accuracy of the logistic regression model was similar in the training and cross-validation samples. The logistic regression model produced accuracy estimates consistent with other published CAMCI studies, suggesting evidence for classification accuracy of the CAMCI for MCI is likely modest. This case study illustrates the general need for cross-validation and careful evaluation of the generalizability of machine learning models. Public Significance Statement Using an archival database, a decision tree machine learning method demonstrated overfitting and had substantially reduced evidence for classification accuracy measured in cross-validation, but the statistical method results in similar evidence of classification in training and cross-validation samples. The evidence for classification accuracy of the Computerized Assessment of Mild Cognitive Impairment for cognitive impairment is modest, and this has clinical implications.</abstract><cop>United States</cop><pub>American Psychological Association</pub><pmid>31414853</pmid><doi>10.1037/pas0000764</doi><tpages>6</tpages><orcidid>https://orcid.org/0000-0001-6159-4322</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 1040-3590
ispartof	Psychological assessment, 2019-11, Vol.31 (11), p.1377-1382
issn	1040-3590 1939-134X
language	eng
recordid	cdi_proquest_miscellaneous_2273756281
source	PsycARTICLES
subjects	Aged Aged, 80 and over Algorithms Cognitive ability Cognitive Assessment Cognitive Dysfunction - diagnosis Cognitive Dysfunction - psychology Cognitive Impairment Computer Assisted Instruction Computers Diagnosis, Computer-Assisted - methods Female Human Humans Logistic Regression Machine Learning Male Medical screening Mental disorders Middle Aged Mild Cognitive Impairment Neuropsychological Tests - standards Reproducibility of Results Scoring (Testing) Screening Tests Sensitivity and Specificity
title	Scoring Algorithms for a Computer-Based Cognitive Screening Tool: An Illustrative Example of Overfitting Machine Learning Approaches and the Impact on Estimates of Classification Accuracy
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T03%3A55%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Scoring%20Algorithms%20for%20a%20Computer-Based%20Cognitive%20Screening%20Tool:%20An%20Illustrative%20Example%20of%20Overfitting%20Machine%20Learning%20Approaches%20and%20the%20Impact%20on%20Estimates%20of%20Classification%20Accuracy&rft.jtitle=Psychological%20assessment&rft.au=Ursenbach,%20Jake&rft.date=2019-11-01&rft.volume=31&rft.issue=11&rft.spage=1377&rft.epage=1382&rft.pages=1377-1382&rft.issn=1040-3590&rft.eissn=1939-134X&rft_id=info:doi/10.1037/pas0000764&rft_dat=%3Cproquest_cross%3E2312226464%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-a381t-d9217f8c8f4d6079d6957b65a56be38da0a7d1bb4a6d9df837f3587327920be63%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2273378070&rft_id=info:pmid/31414853&rfr_iscdi=true