Loading…

Development of novel hybrid machine learning models for monthly thunderstorm frequency prediction over Bangladesh

Thunderstorm frequency (TSF) prediction with higher accuracy is of great significance under climate extremes for reducing potential damages. However, TSF prediction has received little attention because a thunderstorm event is a combination of intricate and unique weather scenarios with high instabi...

Full description

Saved in:
Bibliographic Details
Published in:Natural hazards (Dordrecht) 2021-08, Vol.108 (1), p.1109-1135
Main Authors: Azad, Md. Abul Kalam, Islam, Abu Reza Md. Towfiqul, Rahman, Md. Siddiqur, Ayen, Kurratul
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Thunderstorm frequency (TSF) prediction with higher accuracy is of great significance under climate extremes for reducing potential damages. However, TSF prediction has received little attention because a thunderstorm event is a combination of intricate and unique weather scenarios with high instability, making it difficult to predict. To close this gap, we proposed two novel hybrid machine learning models through hybridization of data pre-processing ensemble empirical mode decomposition (EEMD) with two state-of-arts models, namely artificial neural network (ANN), support vector machine for TSF prediction at three categories over Bangladesh. We have demarcated the yearly TSF datasets into three categories for the period 1981–2016 recorded at 28 sites; high (March–June), moderate (July–October), and low (November–February) TSF months. The performance of the proposed EEMD-ANN and EEMD-SVM hybrid models was compared with classical ANN, SVM, and autoregressive integrated moving average. EEMD-ANN and EEMD-SVM hybrid models showed 8.02–22.48% higher performance precision in terms of root mean square error compared to other models at high-, moderate-, and low-frequency categories. Eleven out of 21 input parameters were selected based on the random forest variable importance analysis. The sensitivity analysis results showed that each input parameter was positively contributed to building the best model of each category, and thunderstorm days are the most contributing parameters influencing TSF prediction. The proposed hybrid models outperformed the conventional models where EEMD-ANN is the most skillful for high TSF prediction, and EEMD-SVM is for moderate and low TSF prediction. The findings indicate the potential of hybridization of EEMD with the conventional models for improving prediction precision. The hybrid models developed in this work can be adopted for TSF prediction in Bangladesh as well as different parts of the world.
ISSN:0921-030X
1573-0840
DOI:10.1007/s11069-021-04722-9