Loading…
Evaluating complex relationships between ecological indicators and environmental factors in the Baltic Sea: A machine learning approach
•Bayesian network classifiers enable analyzing probabilistic dependencies in data.•We provide a protocol for assessing indicator responses to environmental factors.•IEMD discretization enables identifying relevant factors and threshold values.•Entropy reduction analysis aids finding the most robust...
Saved in:
Published in: | Ecological indicators 2019-06, Vol.101, p.117-125 |
---|---|
Main Authors: | , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •Bayesian network classifiers enable analyzing probabilistic dependencies in data.•We provide a protocol for assessing indicator responses to environmental factors.•IEMD discretization enables identifying relevant factors and threshold values.•Entropy reduction analysis aids finding the most robust predictors for the indicator.•A case example on the Baltic Sea coastal fish indicators is presented.
The state of marine ecosystems is increasingly evaluated using indicators. The indicator assessment results need to be understood in the context of the whole ecosystem in order to understand the key factors determining the status of these environmental components. Data available from the system’s different components are, however, often heterogeneous: they may represent different spatial and temporal scales, and different parameters can be measured with different accuracy. This makes it difficult to evaluate the relationship between these variables and status of the environment using indicators. We studied whether probabilistic, machine learning-based classifiers could provide for assessing the relationships between multiple environmental factors and ecological indicators. This paper demonstrates the use of Bayesian network classifiers (Tree-augmented Naive Bayes classifier, TAN as the specific case example), used together with structural learning from data and Entropy Minimization Discretization (IEMD) algorithm to study environment-indicator relationships within coastal fish communities in the Baltic Sea. By using two Baltic-wide indicators of coastal fish community status and a heterogeneous set of potentially influential natural and anthropogenic variables, we explore and discuss the potential of the approach. Given pre-defined cutting points for the indicators, such as the classification thresholds of the indicator, the method enables identifying relevant variables and estimating their relative importance. This information could be used in environmental management to demonstrate at which threshold value the state of an indicator is likely to respond to a pressure or a combination of pressures. In contrast to many other multivariate statistical methodologies, the presented approach can handle missing data as well as data of varying types, from fully quantitative to presence-absence, in the same analysis. |
---|---|
ISSN: | 1470-160X 1872-7034 1872-7034 |
DOI: | 10.1016/j.ecolind.2018.12.053 |