Loading…
Streamflow forecasting using extreme gradient boosting model coupled with Gaussian mixture model
•A new model is developed for streamflow forecasting based on the principle of modular model.•Extreme gradient boosting (XGBoost)-based models produced satisfactory forecasts.•Gaussian mixture model was coupled with XGBoost, and the results indicated the hybrid model was a superior alternative. The...
Saved in:
Published in: | Journal of hydrology (Amsterdam) 2020-07, Vol.586, p.124901, Article 124901 |
---|---|
Main Authors: | , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •A new model is developed for streamflow forecasting based on the principle of modular model.•Extreme gradient boosting (XGBoost)-based models produced satisfactory forecasts.•Gaussian mixture model was coupled with XGBoost, and the results indicated the hybrid model was a superior alternative.
The establishment of an accurate and reliable forecasting model is important for water resource planning and management. In this study, we developed a hybrid model (namely GMM-XGBoost), coupling extreme gradient boosting (XGBoost) with Gaussian mixture model (GMM), for monthly streamflow forecasting. The proposed model is based on the principle of modular model, where a complex problem is divided into several simple ones. GMM was applied to cluster streamflow into several groups, using the features selected by a tree-based method. Then, each group was used to fit several single XGBoosts. And the prediction is a weighted average of the single models. Monthly streamflow data at Cuntan and Hankou stations on Yangtze River Basin were used to evaluate the performance of the proposed model. To compare the forecasting efficiency, support vector machine (SVM) and standalone XGBoost were selected as the benchmark models. The results indicated that although all three models yielded quite good performance on one-month ahead forecasting with high Nash-Sutclitte efficiency coefficient (NSE) and low root mean squared error (RMSE), GMM-XGBoost provided the best accuracy with significant improvement of forecasting accuracy. It can be inferred from the results that (1) XGBoost is applicable for streamflow forecasting, and in general, performs better than SVM; (2) the cluster analysis-based modular model is helpful in improving accuracy and capturing the complicated patterns of hydrological process; (3) the proposed GMM-XGBoost model is a superior alternative, which can provide accurate and reliable predictions for optimal water resources management. |
---|---|
ISSN: | 0022-1694 1879-2707 |
DOI: | 10.1016/j.jhydrol.2020.124901 |