Loading…

Snow water equivalent prediction in a mountainous area using hybrid bagging machine learning approaches

Snow Water Equivalent (SWE) is one of the most critical variables in mountainous watersheds and needs to be considered in water resources management plans. As direct measurement of SWE is difficult and empirical equations are highly uncertain, the present study aimed to obtain accurate predictions o...

Full description

Saved in:
Bibliographic Details
Published in:Acta geophysica 2023-04, Vol.71 (2), p.1015-1031
Main Authors: Khosravi, Khabat, Golkarian, Ali, Omidvar, Ebrahim, Hatamiafkoueieh, Javad, Shirali, Masoud
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Snow Water Equivalent (SWE) is one of the most critical variables in mountainous watersheds and needs to be considered in water resources management plans. As direct measurement of SWE is difficult and empirical equations are highly uncertain, the present study aimed to obtain accurate predictions of SWE using machine learning methods. Five standalone algorithms of tree-based [M5P and random tree (RT)], rule-based [M5Rules (M5R)] and lazy-based learner (IBK and Kstar) and five novel hybrid bagging-based algorithms (BA) with standalone models (i.e., BA-M5P, BA-RT, BA-IBK, BA-Kstar and BA-M5R) were developed. A total of 2550 snow measurements were collected from 62 snow and rain-gauge stations located in 13 mountainous provinces in Iran. Data including ice beneath the snow (IBS), fresh snow depth (FSD), length of snow sample (LSS), snow density (SDN), snow depth (SD) and time of falling (TS) were measured. Based on the Pearson correlation between inputs (IBS, FSD, LSS, SDN, SD and TS) and output (SWE), six different input combinations were constructed. The dataset was separated into two groups (70% and 30% of the data) by a cross-validation technique for model construction (training dataset) and model evaluation (testing dataset), respectively. Different visual and quantitative metrics (e.g., Nash–Sutcliffe efficiency (NSE)) were used for evaluating model accuracy. It was found that SD had the highest correlation with SWE in Iran ( r  = 0.73). In general, the bootstrap aggregation (i.e., bagging) hybrid machine learning methods (BA-M5P, BA-RT, BA-IBK, BA-Kstar and BA-M5R) increased prediction accuracy when compared to each standalone method. While BA-M5R had the highest prediction accuracy (NSE = 0.83) (considering all six input variables), BA-IBK could predict SWE with high accuracy (NSE = 0.71) using only two input variables (SD and LSS). Our findings demonstrate that SWE can be accurately predicted through a variety of machine learning methods using easily measurable variables and may be useful for applications in other mountainous regions across the globe.
ISSN:1895-7455
1895-6572
1895-7455
DOI:10.1007/s11600-022-00934-0