Loading…

A review of evidence about use and performance of species distribution modelling ensembles like BIOMOD

Aim The idea of combining predictions from different models into an ensemble has gained considerable popularity in species distribution modelling, partly due to free and comprehensive software such as the R package BIOMOD. However, despite proliferation of ensemble models, we lack oversight of how a...

Full description

Saved in:
Bibliographic Details
Published in:Diversity & distributions 2019-05, Vol.25 (5), p.839-852
Main Authors: Hao, Tianxiao, Elith, Jane, Guillera-Arroita, Gurutzeta, Lahoz-Monfort, José J.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c3542-37e005bb9f5a949ed4dee6888977a773e9b618f0b5ef44fddcb9fe7078f7ff9d3
cites cdi_FETCH-LOGICAL-c3542-37e005bb9f5a949ed4dee6888977a773e9b618f0b5ef44fddcb9fe7078f7ff9d3
container_end_page 852
container_issue 5
container_start_page 839
container_title Diversity & distributions
container_volume 25
creator Hao, Tianxiao
Elith, Jane
Guillera-Arroita, Gurutzeta
Lahoz-Monfort, José J.
description Aim The idea of combining predictions from different models into an ensemble has gained considerable popularity in species distribution modelling, partly due to free and comprehensive software such as the R package BIOMOD. However, despite proliferation of ensemble models, we lack oversight of how and where they are used for modelling distributions, and how well they perform. Here, we present such an overview. Location Global. Methods Since BIOMOD is freely available and widely used by ensemble species distribution modellers, we focused on articles that apply BIOMOD, filtering the initial 852 papers identified in our structured literature search to a relevant final subset of 224 eligible peer‐reviewed journal articles. Results BIOMOD‐based ensembles are used across many taxa and locations, with terrestrial plants being the most represented group of species (n = 72) and Europe being the most represented continent (n = 106). These studies often focus on forecasting distributions in the future (n = 109), and commonly use presence‐only species data (n = 139) and climatic environmental predictors (n = 219). An average of six models are used in ensembles, and approximately half of ensembles weight contributions of models by their cross‐validation performance. However, discussion about choices made in the modelling process and unambiguous information on the performance of ensemble models versus individual models are limited. The use of independent data to validate model performance is particularly uncommon. Main conclusions We document the breadth of ensemble applications, but could not draw strong quantitative conclusions about the predictive performance of ensemble models, due to lack of unambiguous information reported. Understanding how and where ensembles are best used when modelling species distributions is important for enabling best choices for different applications. To enable this objective to be achieved, we provide recommendations for thorough reporting practices in a BIOMOD‐based ensemble workflow.
doi_str_mv 10.1111/ddi.12892
format article
fullrecord <record><control><sourceid>jstor_JFNAL</sourceid><recordid>TN_cdi_proquest_journals_2213013904</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><jstor_id>26618567</jstor_id><sourcerecordid>26618567</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3542-37e005bb9f5a949ed4dee6888977a773e9b618f0b5ef44fddcb9fe7078f7ff9d3</originalsourceid><addsrcrecordid>eNp1kMtOwzAQRSMEEqWw4AOQLLFikdavxPGyUB6VirqBdZTEY-SSxMFOqPr3uATYMZu50pw7M7pRdEnwjISaK2VmhGaSHkUTwgWNecrpcdAsTWOZkPQ0OvN-izFmLKGTSC-Qg08DO2Q1CkJBWwEqSjv0aPBBtQp14LR1TXGYBMp3UBnwSBnfO1MOvbEtaqyCujbtG4LWQ1PWAajNO6Db1eZ5szyPTnRRe7j46dPo9eH-5e4pXm8eV3eLdVyxhNOYCcA4KUupk0JyCYorgDTLMilEIQQDWaYk07hMQHOulaoCCgKLTAutpWLT6Hrc2zn7MYDv860dXBtO5pQShgmTmAfqZqQqZ713oPPOmaZw-5zg_BBjHmLMv2MM7Hxkd6aG_f9gvlyufh1Xo2Pre-v-HDQNryepYF_Kkn5L</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2213013904</pqid></control><display><type>article</type><title>A review of evidence about use and performance of species distribution modelling ensembles like BIOMOD</title><source>Jstor Journals Open Access</source><creator>Hao, Tianxiao ; Elith, Jane ; Guillera-Arroita, Gurutzeta ; Lahoz-Monfort, José J.</creator><contributor>Serra‐Diaz, Josep</contributor><creatorcontrib>Hao, Tianxiao ; Elith, Jane ; Guillera-Arroita, Gurutzeta ; Lahoz-Monfort, José J. ; Serra‐Diaz, Josep</creatorcontrib><description>Aim The idea of combining predictions from different models into an ensemble has gained considerable popularity in species distribution modelling, partly due to free and comprehensive software such as the R package BIOMOD. However, despite proliferation of ensemble models, we lack oversight of how and where they are used for modelling distributions, and how well they perform. Here, we present such an overview. Location Global. Methods Since BIOMOD is freely available and widely used by ensemble species distribution modellers, we focused on articles that apply BIOMOD, filtering the initial 852 papers identified in our structured literature search to a relevant final subset of 224 eligible peer‐reviewed journal articles. Results BIOMOD‐based ensembles are used across many taxa and locations, with terrestrial plants being the most represented group of species (n = 72) and Europe being the most represented continent (n = 106). These studies often focus on forecasting distributions in the future (n = 109), and commonly use presence‐only species data (n = 139) and climatic environmental predictors (n = 219). An average of six models are used in ensembles, and approximately half of ensembles weight contributions of models by their cross‐validation performance. However, discussion about choices made in the modelling process and unambiguous information on the performance of ensemble models versus individual models are limited. The use of independent data to validate model performance is particularly uncommon. Main conclusions We document the breadth of ensemble applications, but could not draw strong quantitative conclusions about the predictive performance of ensemble models, due to lack of unambiguous information reported. Understanding how and where ensembles are best used when modelling species distributions is important for enabling best choices for different applications. To enable this objective to be achieved, we provide recommendations for thorough reporting practices in a BIOMOD‐based ensemble workflow.</description><identifier>ISSN: 1366-9516</identifier><identifier>EISSN: 1472-4642</identifier><identifier>DOI: 10.1111/ddi.12892</identifier><language>eng</language><publisher>Oxford: Wiley</publisher><subject>BIODIVERSITY REVIEW ; BIOMOD ; Climate change ; consensus forecast ; Discriminant analysis ; ecological niche models ; ensemble ; Generalized linear models ; habitat suitability models ; Information processing ; Mathematical models ; Methods ; Modelling ; Performance prediction ; Species ; species distribution model ; Terrestrial environments ; Workflow</subject><ispartof>Diversity &amp; distributions, 2019-05, Vol.25 (5), p.839-852</ispartof><rights>2019 The Authors</rights><rights>2019 The Authors. Published by John Wiley &amp; Sons Ltd</rights><rights>2019. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3542-37e005bb9f5a949ed4dee6888977a773e9b618f0b5ef44fddcb9fe7078f7ff9d3</citedby><cites>FETCH-LOGICAL-c3542-37e005bb9f5a949ed4dee6888977a773e9b618f0b5ef44fddcb9fe7078f7ff9d3</cites><orcidid>0000-0003-4363-1956 ; 0000-0002-8706-0326 ; 0000-0002-8387-5739 ; 0000-0002-0845-7035</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2213013904/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2213013904?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,778,782,11545,25337,25736,27907,27908,36995,44573,46035,46459,54507,54513,58221,58454,74877</link.rule.ids><linktorsrc>$$Uhttps://www.jstor.org/stable/26618567$$EView_record_in_JSTOR$$FView_record_in_$$GJSTOR</linktorsrc></links><search><contributor>Serra‐Diaz, Josep</contributor><creatorcontrib>Hao, Tianxiao</creatorcontrib><creatorcontrib>Elith, Jane</creatorcontrib><creatorcontrib>Guillera-Arroita, Gurutzeta</creatorcontrib><creatorcontrib>Lahoz-Monfort, José J.</creatorcontrib><title>A review of evidence about use and performance of species distribution modelling ensembles like BIOMOD</title><title>Diversity &amp; distributions</title><description>Aim The idea of combining predictions from different models into an ensemble has gained considerable popularity in species distribution modelling, partly due to free and comprehensive software such as the R package BIOMOD. However, despite proliferation of ensemble models, we lack oversight of how and where they are used for modelling distributions, and how well they perform. Here, we present such an overview. Location Global. Methods Since BIOMOD is freely available and widely used by ensemble species distribution modellers, we focused on articles that apply BIOMOD, filtering the initial 852 papers identified in our structured literature search to a relevant final subset of 224 eligible peer‐reviewed journal articles. Results BIOMOD‐based ensembles are used across many taxa and locations, with terrestrial plants being the most represented group of species (n = 72) and Europe being the most represented continent (n = 106). These studies often focus on forecasting distributions in the future (n = 109), and commonly use presence‐only species data (n = 139) and climatic environmental predictors (n = 219). An average of six models are used in ensembles, and approximately half of ensembles weight contributions of models by their cross‐validation performance. However, discussion about choices made in the modelling process and unambiguous information on the performance of ensemble models versus individual models are limited. The use of independent data to validate model performance is particularly uncommon. Main conclusions We document the breadth of ensemble applications, but could not draw strong quantitative conclusions about the predictive performance of ensemble models, due to lack of unambiguous information reported. Understanding how and where ensembles are best used when modelling species distributions is important for enabling best choices for different applications. To enable this objective to be achieved, we provide recommendations for thorough reporting practices in a BIOMOD‐based ensemble workflow.</description><subject>BIODIVERSITY REVIEW</subject><subject>BIOMOD</subject><subject>Climate change</subject><subject>consensus forecast</subject><subject>Discriminant analysis</subject><subject>ecological niche models</subject><subject>ensemble</subject><subject>Generalized linear models</subject><subject>habitat suitability models</subject><subject>Information processing</subject><subject>Mathematical models</subject><subject>Methods</subject><subject>Modelling</subject><subject>Performance prediction</subject><subject>Species</subject><subject>species distribution model</subject><subject>Terrestrial environments</subject><subject>Workflow</subject><issn>1366-9516</issn><issn>1472-4642</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>24P</sourceid><sourceid>PIMPY</sourceid><recordid>eNp1kMtOwzAQRSMEEqWw4AOQLLFikdavxPGyUB6VirqBdZTEY-SSxMFOqPr3uATYMZu50pw7M7pRdEnwjISaK2VmhGaSHkUTwgWNecrpcdAsTWOZkPQ0OvN-izFmLKGTSC-Qg08DO2Q1CkJBWwEqSjv0aPBBtQp14LR1TXGYBMp3UBnwSBnfO1MOvbEtaqyCujbtG4LWQ1PWAajNO6Db1eZ5szyPTnRRe7j46dPo9eH-5e4pXm8eV3eLdVyxhNOYCcA4KUupk0JyCYorgDTLMilEIQQDWaYk07hMQHOulaoCCgKLTAutpWLT6Hrc2zn7MYDv860dXBtO5pQShgmTmAfqZqQqZ713oPPOmaZw-5zg_BBjHmLMv2MM7Hxkd6aG_f9gvlyufh1Xo2Pre-v-HDQNryepYF_Kkn5L</recordid><startdate>20190501</startdate><enddate>20190501</enddate><creator>Hao, Tianxiao</creator><creator>Elith, Jane</creator><creator>Guillera-Arroita, Gurutzeta</creator><creator>Lahoz-Monfort, José J.</creator><general>Wiley</general><general>John Wiley &amp; Sons, Inc</general><scope>24P</scope><scope>WIN</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SN</scope><scope>7XB</scope><scope>8FE</scope><scope>8FH</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>LK8</scope><scope>M2O</scope><scope>M7N</scope><scope>M7P</scope><scope>MBDVC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0003-4363-1956</orcidid><orcidid>https://orcid.org/0000-0002-8706-0326</orcidid><orcidid>https://orcid.org/0000-0002-8387-5739</orcidid><orcidid>https://orcid.org/0000-0002-0845-7035</orcidid></search><sort><creationdate>20190501</creationdate><title>A review of evidence about use and performance of species distribution modelling ensembles like BIOMOD</title><author>Hao, Tianxiao ; Elith, Jane ; Guillera-Arroita, Gurutzeta ; Lahoz-Monfort, José J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3542-37e005bb9f5a949ed4dee6888977a773e9b618f0b5ef44fddcb9fe7078f7ff9d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>BIODIVERSITY REVIEW</topic><topic>BIOMOD</topic><topic>Climate change</topic><topic>consensus forecast</topic><topic>Discriminant analysis</topic><topic>ecological niche models</topic><topic>ensemble</topic><topic>Generalized linear models</topic><topic>habitat suitability models</topic><topic>Information processing</topic><topic>Mathematical models</topic><topic>Methods</topic><topic>Modelling</topic><topic>Performance prediction</topic><topic>Species</topic><topic>species distribution model</topic><topic>Terrestrial environments</topic><topic>Workflow</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hao, Tianxiao</creatorcontrib><creatorcontrib>Elith, Jane</creatorcontrib><creatorcontrib>Guillera-Arroita, Gurutzeta</creatorcontrib><creatorcontrib>Lahoz-Monfort, José J.</creatorcontrib><collection>Wiley-Blackwell Open Access Collection</collection><collection>Wiley Online Library Open Access</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Ecology Abstracts</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Research Library</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biological Science Database</collection><collection>Research Library (Corporate)</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>Diversity &amp; distributions</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hao, Tianxiao</au><au>Elith, Jane</au><au>Guillera-Arroita, Gurutzeta</au><au>Lahoz-Monfort, José J.</au><au>Serra‐Diaz, Josep</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A review of evidence about use and performance of species distribution modelling ensembles like BIOMOD</atitle><jtitle>Diversity &amp; distributions</jtitle><date>2019-05-01</date><risdate>2019</risdate><volume>25</volume><issue>5</issue><spage>839</spage><epage>852</epage><pages>839-852</pages><issn>1366-9516</issn><eissn>1472-4642</eissn><abstract>Aim The idea of combining predictions from different models into an ensemble has gained considerable popularity in species distribution modelling, partly due to free and comprehensive software such as the R package BIOMOD. However, despite proliferation of ensemble models, we lack oversight of how and where they are used for modelling distributions, and how well they perform. Here, we present such an overview. Location Global. Methods Since BIOMOD is freely available and widely used by ensemble species distribution modellers, we focused on articles that apply BIOMOD, filtering the initial 852 papers identified in our structured literature search to a relevant final subset of 224 eligible peer‐reviewed journal articles. Results BIOMOD‐based ensembles are used across many taxa and locations, with terrestrial plants being the most represented group of species (n = 72) and Europe being the most represented continent (n = 106). These studies often focus on forecasting distributions in the future (n = 109), and commonly use presence‐only species data (n = 139) and climatic environmental predictors (n = 219). An average of six models are used in ensembles, and approximately half of ensembles weight contributions of models by their cross‐validation performance. However, discussion about choices made in the modelling process and unambiguous information on the performance of ensemble models versus individual models are limited. The use of independent data to validate model performance is particularly uncommon. Main conclusions We document the breadth of ensemble applications, but could not draw strong quantitative conclusions about the predictive performance of ensemble models, due to lack of unambiguous information reported. Understanding how and where ensembles are best used when modelling species distributions is important for enabling best choices for different applications. To enable this objective to be achieved, we provide recommendations for thorough reporting practices in a BIOMOD‐based ensemble workflow.</abstract><cop>Oxford</cop><pub>Wiley</pub><doi>10.1111/ddi.12892</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0003-4363-1956</orcidid><orcidid>https://orcid.org/0000-0002-8706-0326</orcidid><orcidid>https://orcid.org/0000-0002-8387-5739</orcidid><orcidid>https://orcid.org/0000-0002-0845-7035</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1366-9516
ispartof Diversity & distributions, 2019-05, Vol.25 (5), p.839-852
issn 1366-9516
1472-4642
language eng
recordid cdi_proquest_journals_2213013904
source Jstor Journals Open Access
subjects BIODIVERSITY REVIEW
BIOMOD
Climate change
consensus forecast
Discriminant analysis
ecological niche models
ensemble
Generalized linear models
habitat suitability models
Information processing
Mathematical models
Methods
Modelling
Performance prediction
Species
species distribution model
Terrestrial environments
Workflow
title A review of evidence about use and performance of species distribution modelling ensembles like BIOMOD
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T21%3A38%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor_JFNAL&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20review%20of%20evidence%20about%20use%20and%20performance%20of%20species%20distribution%20modelling%20ensembles%20like%20BIOMOD&rft.jtitle=Diversity%20&%20distributions&rft.au=Hao,%20Tianxiao&rft.date=2019-05-01&rft.volume=25&rft.issue=5&rft.spage=839&rft.epage=852&rft.pages=839-852&rft.issn=1366-9516&rft.eissn=1472-4642&rft_id=info:doi/10.1111/ddi.12892&rft_dat=%3Cjstor_JFNAL%3E26618567%3C/jstor_JFNAL%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3542-37e005bb9f5a949ed4dee6888977a773e9b618f0b5ef44fddcb9fe7078f7ff9d3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2213013904&rft_id=info:pmid/&rft_jstor_id=26618567&rfr_iscdi=true