Loading…

Methods to quantify variable importance: implications for the analysis of noisy ecological data

Determining the importance of independent variables is of practical relevance to ecologists and managers concerned with allocating limited resources to the management of natural systems. Although techniques that identify explanatory variables having the largest influence on the response variable are...

Full description

Saved in:
Bibliographic Details
Published in:Ecology (Durham) 2009-02, Vol.90 (2), p.348-355
Main Authors: Murray, Kim, Mary M. Conner
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c4978-b2b849b9d409a8e7e438397d3e43ceacc130ff5196989290fadf30a2548d257c3
cites cdi_FETCH-LOGICAL-c4978-b2b849b9d409a8e7e438397d3e43ceacc130ff5196989290fadf30a2548d257c3
container_end_page 355
container_issue 2
container_start_page 348
container_title Ecology (Durham)
container_volume 90
creator Murray, Kim
Mary M. Conner
description Determining the importance of independent variables is of practical relevance to ecologists and managers concerned with allocating limited resources to the management of natural systems. Although techniques that identify explanatory variables having the largest influence on the response variable are needed to design management actions effectively, the use of various indices to evaluate variable importance is poorly understood. Using Monte Carlo simulations, we compared six different indices commonly used to evaluate variable importance; zero‐order correlations, partial correlations, semipartial correlations, standardized regression coefficients, Akaike weights, and independent effects. We simulated four scenarios to evaluate the indices under progressively more complex circumstances that included correlation between explanatory variables, as well as a spurious variable that was correlated with other explanatory variables, but not with the dependent variable. No index performed perfectly under all circumstances, but partial correlations and Akaike weights performed poorly in all cases. Zero‐order correlations was the only measure that detected the presence of a spurious variable, whereas only independent effects assigned overlap areas correctly once the spurious variable was removed. We therefore recommend using zero‐order correlations to eliminate predictor variables with correlations near zero, followed by the use of independent effects to assign overlap areas and rank variable importance.
doi_str_mv 10.1890/07-1929.1
format article
fullrecord <record><control><sourceid>jstor_proqu</sourceid><recordid>TN_cdi_proquest_miscellaneous_67075167</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><jstor_id>27650989</jstor_id><sourcerecordid>27650989</sourcerecordid><originalsourceid>FETCH-LOGICAL-c4978-b2b849b9d409a8e7e438397d3e43ceacc130ff5196989290fadf30a2548d257c3</originalsourceid><addsrcrecordid>eNqFksGKFDEQhoMo7rh68AHUICh46LWSdHcSbzLsqrDiQffgKWTSyW6Gns5s0q3021tDD7sgiLmkoL78VfVXCHnO4IwpDe9BVkxzfcYekBXTQleaSXhIVgCMV7pt1Al5UsoW8LBaPSYnCHHBmVoR89WPN6krdEz0drLDGMNMf9kc7ab3NO72KY92cP7DIe6js2NMQ6EhZTreeGoH288lFpoCHVIsM_Uu9ekawZ52drRPyaNg--KfHe9TcnVx_mP9ubr89unL-uNl5WotVbXhG1Xrje5q0FZ56WuhhJadwMB56xwTEELDdKsVTgrBdkGA5U2tOt5IJ07J20V3n9Pt5MtodrE43_d28GkqppUgG9bK_4IchG5VoxF8_Re4TVPGeZFhGngjoEXo3QK5nErJPph9jjubZ8PAHHZjQJrDbgxD9uVRcNrsfHdPHpeBwJsjYAv6FzIaH8sdxxkKSuDI1Qv3O_Z-_ndFc77-yQE0Nivqg_yL5dm2jCnfy8q2AXQV86-WfLDJ2OuMpa--c2At_hq0pG7EH8mUt0A</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>219025306</pqid></control><display><type>article</type><title>Methods to quantify variable importance: implications for the analysis of noisy ecological data</title><source>Wiley</source><source>JSTOR</source><creator>Murray, Kim ; Mary M. Conner</creator><creatorcontrib>Murray, Kim ; Mary M. Conner</creatorcontrib><description>Determining the importance of independent variables is of practical relevance to ecologists and managers concerned with allocating limited resources to the management of natural systems. Although techniques that identify explanatory variables having the largest influence on the response variable are needed to design management actions effectively, the use of various indices to evaluate variable importance is poorly understood. Using Monte Carlo simulations, we compared six different indices commonly used to evaluate variable importance; zero‐order correlations, partial correlations, semipartial correlations, standardized regression coefficients, Akaike weights, and independent effects. We simulated four scenarios to evaluate the indices under progressively more complex circumstances that included correlation between explanatory variables, as well as a spurious variable that was correlated with other explanatory variables, but not with the dependent variable. No index performed perfectly under all circumstances, but partial correlations and Akaike weights performed poorly in all cases. Zero‐order correlations was the only measure that detected the presence of a spurious variable, whereas only independent effects assigned overlap areas correctly once the spurious variable was removed. We therefore recommend using zero‐order correlations to eliminate predictor variables with correlations near zero, followed by the use of independent effects to assign overlap areas and rank variable importance.</description><identifier>ISSN: 0012-9658</identifier><identifier>EISSN: 1939-9170</identifier><identifier>DOI: 10.1890/07-1929.1</identifier><identifier>PMID: 19323218</identifier><identifier>CODEN: ECGYAQ</identifier><language>eng</language><publisher>Washington, DC: Ecological Society of America</publisher><subject>Akaike weights ; Analysis ; Animal and plant ecology ; Animal, plant and microbial ecology ; Applied ecology ; beta coefficients ; beta weights ; Biological and medical sciences ; Computer Simulation ; Conservation biology ; Conservation of Natural Resources - methods ; Correlation coefficients ; Correlations ; data analysis ; dominance analysis ; Ecological modeling ; ecologists ; Ecology ; Ecosystem ; Fundamental and applied biological sciences. Psychology ; General aspects ; hierarchical partitioning ; independent effects ; managers ; Methods ; Modeling ; Models, Biological ; Monte Carlo Method ; Monte Carlo simulation ; Natural resource management ; partial correlation coefficients ; Regression coefficients ; relative weights ; Riparian ecology ; standardized regression coefficients ; Wildlife ecology ; Wildlife management</subject><ispartof>Ecology (Durham), 2009-02, Vol.90 (2), p.348-355</ispartof><rights>Copyright 2009 Ecological Society of America</rights><rights>2009 by the Ecological Society of America</rights><rights>2009 INIST-CNRS</rights><rights>Copyright Ecological Society of America Feb 2009</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c4978-b2b849b9d409a8e7e438397d3e43ceacc130ff5196989290fadf30a2548d257c3</citedby><cites>FETCH-LOGICAL-c4978-b2b849b9d409a8e7e438397d3e43ceacc130ff5196989290fadf30a2548d257c3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.jstor.org/stable/pdf/27650989$$EPDF$$P50$$Gjstor$$H</linktopdf><linktohtml>$$Uhttps://www.jstor.org/stable/27650989$$EHTML$$P50$$Gjstor$$H</linktohtml><link.rule.ids>314,780,784,27923,27924,58237,58470</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=21189702$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/19323218$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Murray, Kim</creatorcontrib><creatorcontrib>Mary M. Conner</creatorcontrib><title>Methods to quantify variable importance: implications for the analysis of noisy ecological data</title><title>Ecology (Durham)</title><addtitle>Ecology</addtitle><description>Determining the importance of independent variables is of practical relevance to ecologists and managers concerned with allocating limited resources to the management of natural systems. Although techniques that identify explanatory variables having the largest influence on the response variable are needed to design management actions effectively, the use of various indices to evaluate variable importance is poorly understood. Using Monte Carlo simulations, we compared six different indices commonly used to evaluate variable importance; zero‐order correlations, partial correlations, semipartial correlations, standardized regression coefficients, Akaike weights, and independent effects. We simulated four scenarios to evaluate the indices under progressively more complex circumstances that included correlation between explanatory variables, as well as a spurious variable that was correlated with other explanatory variables, but not with the dependent variable. No index performed perfectly under all circumstances, but partial correlations and Akaike weights performed poorly in all cases. Zero‐order correlations was the only measure that detected the presence of a spurious variable, whereas only independent effects assigned overlap areas correctly once the spurious variable was removed. We therefore recommend using zero‐order correlations to eliminate predictor variables with correlations near zero, followed by the use of independent effects to assign overlap areas and rank variable importance.</description><subject>Akaike weights</subject><subject>Analysis</subject><subject>Animal and plant ecology</subject><subject>Animal, plant and microbial ecology</subject><subject>Applied ecology</subject><subject>beta coefficients</subject><subject>beta weights</subject><subject>Biological and medical sciences</subject><subject>Computer Simulation</subject><subject>Conservation biology</subject><subject>Conservation of Natural Resources - methods</subject><subject>Correlation coefficients</subject><subject>Correlations</subject><subject>data analysis</subject><subject>dominance analysis</subject><subject>Ecological modeling</subject><subject>ecologists</subject><subject>Ecology</subject><subject>Ecosystem</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>General aspects</subject><subject>hierarchical partitioning</subject><subject>independent effects</subject><subject>managers</subject><subject>Methods</subject><subject>Modeling</subject><subject>Models, Biological</subject><subject>Monte Carlo Method</subject><subject>Monte Carlo simulation</subject><subject>Natural resource management</subject><subject>partial correlation coefficients</subject><subject>Regression coefficients</subject><subject>relative weights</subject><subject>Riparian ecology</subject><subject>standardized regression coefficients</subject><subject>Wildlife ecology</subject><subject>Wildlife management</subject><issn>0012-9658</issn><issn>1939-9170</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2009</creationdate><recordtype>article</recordtype><recordid>eNqFksGKFDEQhoMo7rh68AHUICh46LWSdHcSbzLsqrDiQffgKWTSyW6Gns5s0q3021tDD7sgiLmkoL78VfVXCHnO4IwpDe9BVkxzfcYekBXTQleaSXhIVgCMV7pt1Al5UsoW8LBaPSYnCHHBmVoR89WPN6krdEz0drLDGMNMf9kc7ab3NO72KY92cP7DIe6js2NMQ6EhZTreeGoH288lFpoCHVIsM_Uu9ekawZ52drRPyaNg--KfHe9TcnVx_mP9ubr89unL-uNl5WotVbXhG1Xrje5q0FZ56WuhhJadwMB56xwTEELDdKsVTgrBdkGA5U2tOt5IJ07J20V3n9Pt5MtodrE43_d28GkqppUgG9bK_4IchG5VoxF8_Re4TVPGeZFhGngjoEXo3QK5nErJPph9jjubZ8PAHHZjQJrDbgxD9uVRcNrsfHdPHpeBwJsjYAv6FzIaH8sdxxkKSuDI1Qv3O_Z-_ndFc77-yQE0Nivqg_yL5dm2jCnfy8q2AXQV86-WfLDJ2OuMpa--c2At_hq0pG7EH8mUt0A</recordid><startdate>200902</startdate><enddate>200902</enddate><creator>Murray, Kim</creator><creator>Mary M. Conner</creator><general>Ecological Society of America</general><scope>FBQ</scope><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QG</scope><scope>7SN</scope><scope>7SS</scope><scope>7ST</scope><scope>7T7</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>K9.</scope><scope>P64</scope><scope>RC3</scope><scope>SOI</scope><scope>7X8</scope></search><sort><creationdate>200902</creationdate><title>Methods to quantify variable importance: implications for the analysis of noisy ecological data</title><author>Murray, Kim ; Mary M. Conner</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c4978-b2b849b9d409a8e7e438397d3e43ceacc130ff5196989290fadf30a2548d257c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2009</creationdate><topic>Akaike weights</topic><topic>Analysis</topic><topic>Animal and plant ecology</topic><topic>Animal, plant and microbial ecology</topic><topic>Applied ecology</topic><topic>beta coefficients</topic><topic>beta weights</topic><topic>Biological and medical sciences</topic><topic>Computer Simulation</topic><topic>Conservation biology</topic><topic>Conservation of Natural Resources - methods</topic><topic>Correlation coefficients</topic><topic>Correlations</topic><topic>data analysis</topic><topic>dominance analysis</topic><topic>Ecological modeling</topic><topic>ecologists</topic><topic>Ecology</topic><topic>Ecosystem</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>General aspects</topic><topic>hierarchical partitioning</topic><topic>independent effects</topic><topic>managers</topic><topic>Methods</topic><topic>Modeling</topic><topic>Models, Biological</topic><topic>Monte Carlo Method</topic><topic>Monte Carlo simulation</topic><topic>Natural resource management</topic><topic>partial correlation coefficients</topic><topic>Regression coefficients</topic><topic>relative weights</topic><topic>Riparian ecology</topic><topic>standardized regression coefficients</topic><topic>Wildlife ecology</topic><topic>Wildlife management</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Murray, Kim</creatorcontrib><creatorcontrib>Mary M. Conner</creatorcontrib><collection>AGRIS</collection><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Animal Behavior Abstracts</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Environment Abstracts</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>Environment Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Ecology (Durham)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Murray, Kim</au><au>Mary M. Conner</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Methods to quantify variable importance: implications for the analysis of noisy ecological data</atitle><jtitle>Ecology (Durham)</jtitle><addtitle>Ecology</addtitle><date>2009-02</date><risdate>2009</risdate><volume>90</volume><issue>2</issue><spage>348</spage><epage>355</epage><pages>348-355</pages><issn>0012-9658</issn><eissn>1939-9170</eissn><coden>ECGYAQ</coden><abstract>Determining the importance of independent variables is of practical relevance to ecologists and managers concerned with allocating limited resources to the management of natural systems. Although techniques that identify explanatory variables having the largest influence on the response variable are needed to design management actions effectively, the use of various indices to evaluate variable importance is poorly understood. Using Monte Carlo simulations, we compared six different indices commonly used to evaluate variable importance; zero‐order correlations, partial correlations, semipartial correlations, standardized regression coefficients, Akaike weights, and independent effects. We simulated four scenarios to evaluate the indices under progressively more complex circumstances that included correlation between explanatory variables, as well as a spurious variable that was correlated with other explanatory variables, but not with the dependent variable. No index performed perfectly under all circumstances, but partial correlations and Akaike weights performed poorly in all cases. Zero‐order correlations was the only measure that detected the presence of a spurious variable, whereas only independent effects assigned overlap areas correctly once the spurious variable was removed. We therefore recommend using zero‐order correlations to eliminate predictor variables with correlations near zero, followed by the use of independent effects to assign overlap areas and rank variable importance.</abstract><cop>Washington, DC</cop><pub>Ecological Society of America</pub><pmid>19323218</pmid><doi>10.1890/07-1929.1</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0012-9658
ispartof Ecology (Durham), 2009-02, Vol.90 (2), p.348-355
issn 0012-9658
1939-9170
language eng
recordid cdi_proquest_miscellaneous_67075167
source Wiley; JSTOR
subjects Akaike weights
Analysis
Animal and plant ecology
Animal, plant and microbial ecology
Applied ecology
beta coefficients
beta weights
Biological and medical sciences
Computer Simulation
Conservation biology
Conservation of Natural Resources - methods
Correlation coefficients
Correlations
data analysis
dominance analysis
Ecological modeling
ecologists
Ecology
Ecosystem
Fundamental and applied biological sciences. Psychology
General aspects
hierarchical partitioning
independent effects
managers
Methods
Modeling
Models, Biological
Monte Carlo Method
Monte Carlo simulation
Natural resource management
partial correlation coefficients
Regression coefficients
relative weights
Riparian ecology
standardized regression coefficients
Wildlife ecology
Wildlife management
title Methods to quantify variable importance: implications for the analysis of noisy ecological data
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T12%3A45%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Methods%20to%20quantify%20variable%20importance:%20implications%20for%20the%20analysis%20of%20noisy%20ecological%20data&rft.jtitle=Ecology%20(Durham)&rft.au=Murray,%20Kim&rft.date=2009-02&rft.volume=90&rft.issue=2&rft.spage=348&rft.epage=355&rft.pages=348-355&rft.issn=0012-9658&rft.eissn=1939-9170&rft.coden=ECGYAQ&rft_id=info:doi/10.1890/07-1929.1&rft_dat=%3Cjstor_proqu%3E27650989%3C/jstor_proqu%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c4978-b2b849b9d409a8e7e438397d3e43ceacc130ff5196989290fadf30a2548d257c3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=219025306&rft_id=info:pmid/19323218&rft_jstor_id=27650989&rfr_iscdi=true