Loading…
Dealing with data conflicts in statistical inference of population assessment models that integrate information from multiple diverse data sets
•Contemporary fisheries stock assessment models often use multiple diverse data.•Structure, variation, and sampling must be modelled appropriately to minimize bias.•Even the basic processes are misspecified.•Misspecified processes result in data conflicts.•External estimation of sampling variance, i...
Saved in:
Published in: | Fisheries research 2017-08, Vol.192, p.16-27 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •Contemporary fisheries stock assessment models often use multiple diverse data.•Structure, variation, and sampling must be modelled appropriately to minimize bias.•Even the basic processes are misspecified.•Misspecified processes result in data conflicts.•External estimation of sampling variance, internal estimation of process variance.
Contemporary fisheries stock assessments often use multiple diverse data sets to extract as much information as possible about biological and fishery processes. However, models are, by definition, simplifications of reality and, therefore, misspecified. Model misspecification can cause degradation of results when multiple data sets are analyzed simultaneously. The process, observation, and sampling components of the model must all be, at least, approximately correct to minimize bias. Unfortunately, even the basic processes that are usually considered well understood (e.g., growth and selectivity) are misspecified in most, if not all, stock assessments. These misspecified processes, in combination with use of composition data, result in biased estimates of absolute abundance and abundance trends, which are often evident as “data conflicts.” This is compounded by over-weighting of composition data in many assessments owing to misuse of data-weighting approaches. The ‘law of conflicting data’ states that since data are facts, conflicting data implies model misspecification, but must be interpreted in the context of random sampling error. Down-weighting (or dropping) conflicting data is not necessarily appropriate because it may not resolve the model misspecification. Model misspecification and process variation can be accounted for in the variance parameters of the likelihoods (sampling error), but it is unclear when, or even if, this is appropriate. The appropriate method to deal with data conflicts depends on whether it is caused by random sampling error, process variation, observation model misspecification, or misspecification of the system (dynamics) model. Diagnostic approaches are urgently needed to evaluate goodness of fit and to identify model misspecification. We recommend external estimation of the sampling error variance in likelihood functions, modelling process variation in integrated models, and internal estimation of the standard deviation of the process variation. The required statistical framework is computationally intensive, but practical approximations are available, computational algorithms are being imp |
---|---|
ISSN: | 0165-7836 1872-6763 |
DOI: | 10.1016/j.fishres.2016.04.022 |