Loading…

Evaluating complex relationships between ecological indicators and environmental factors in the Baltic Sea: A machine learning approach

•Bayesian network classifiers enable analyzing probabilistic dependencies in data.•We provide a protocol for assessing indicator responses to environmental factors.•IEMD discretization enables identifying relevant factors and threshold values.•Entropy reduction analysis aids finding the most robust...

Full description

Saved in:
Bibliographic Details
Published in:Ecological indicators 2019-06, Vol.101, p.117-125
Main Authors: Lehikoinen, Annukka, Olsson, Jens, Bergström, Lena, Bergström, Ulf, Bryhn, Andreas, Fredriksson, Ronny, Uusitalo, Laura
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•Bayesian network classifiers enable analyzing probabilistic dependencies in data.•We provide a protocol for assessing indicator responses to environmental factors.•IEMD discretization enables identifying relevant factors and threshold values.•Entropy reduction analysis aids finding the most robust predictors for the indicator.•A case example on the Baltic Sea coastal fish indicators is presented. The state of marine ecosystems is increasingly evaluated using indicators. The indicator assessment results need to be understood in the context of the whole ecosystem in order to understand the key factors determining the status of these environmental components. Data available from the system’s different components are, however, often heterogeneous: they may represent different spatial and temporal scales, and different parameters can be measured with different accuracy. This makes it difficult to evaluate the relationship between these variables and status of the environment using indicators. We studied whether probabilistic, machine learning-based classifiers could provide for assessing the relationships between multiple environmental factors and ecological indicators. This paper demonstrates the use of Bayesian network classifiers (Tree-augmented Naive Bayes classifier, TAN as the specific case example), used together with structural learning from data and Entropy Minimization Discretization (IEMD) algorithm to study environment-indicator relationships within coastal fish communities in the Baltic Sea. By using two Baltic-wide indicators of coastal fish community status and a heterogeneous set of potentially influential natural and anthropogenic variables, we explore and discuss the potential of the approach. Given pre-defined cutting points for the indicators, such as the classification thresholds of the indicator, the method enables identifying relevant variables and estimating their relative importance. This information could be used in environmental management to demonstrate at which threshold value the state of an indicator is likely to respond to a pressure or a combination of pressures. In contrast to many other multivariate statistical methodologies, the presented approach can handle missing data as well as data of varying types, from fully quantitative to presence-absence, in the same analysis.
ISSN:1470-160X
1872-7034
1872-7034
DOI:10.1016/j.ecolind.2018.12.053