Loading…

Comparison of feature selection methods using ANNs in MCP-wind speed methods. A case study

•An analysis is carried out of the benefits of feature selection in MCP methods which use ANNs.•The wrapper approach (WA) generated lower mean errors than the filter approach (FA).•No significant statistical difference was observed between the WA and the FA in certain cases.•The FA generated models...

Full description

Saved in:
Bibliographic Details
Published in:Applied energy 2015-11, Vol.158, p.490-507
Main Authors: Carta, José A., Cabrera, Pedro, Matías, José M., Castellano, Fernando
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•An analysis is carried out of the benefits of feature selection in MCP methods which use ANNs.•The wrapper approach (WA) generated lower mean errors than the filter approach (FA).•No significant statistical difference was observed between the WA and the FA in certain cases.•The FA generated models somewhat simpler and more interpretable than the WA.•The WA displayed better predictive capacity than the FA, but is more computationally intensive. Recent studies in the field of renewable energies, and specifically in wind resource prediction, have shown growing interest in proposals for Measure–Correlate–Predict (MCP) methods which simultaneously use data recorded at various reference weather stations. In this context, the use of a high number of reference stations may result in overspecification with its associated negative effects. These include, amongst others, an increase in the estimation error and/or overfitting which could be detrimental to the generalisation capacity of the model when handling new data (prediction). This paper analyses the benefits of feature selection for use with Artificial Neural Network (ANN) techniques with a multilayer perceptron (MLP) structure when the ANNs are used as MCP methods to predict mean hourly wind speeds at a target site. The features considered in this study were the mean hourly wind speeds and directions recorded in 2003 and 2004 at five weather stations in the Canary Archipelago (Spain). The two feature selection techniques considered in the analysis were the Correlation Feature Selection (CFS), which is a correlation-based filter approach (FA), and an MLP-based wrapper approach (WA). The metrics used to compare the results were the mean absolute error (MAE), the mean absolute percentage error (MAPE) and the index of agreement (IoA). Evaluation of the mean errors obtained in the 10-fold cross-validation tests for the year used to represent the short-term wind data period resulted in several conclusions. These included, notably, that the WA gave lower mean errors than the FA in 100% of the cases analysed independently of the metric employed. However, the FA resulted in a significant reduction in computational load and considerable enhancement of model interpretability. When very good correlation coefficients were obtained between the target and reference stations, no significant statistical difference was observed at 5% level between the three models (FA, WA and the models constructed with all the variables) in mo
ISSN:0306-2619
1872-9118
DOI:10.1016/j.apenergy.2015.08.102