Loading…
Comparison of feature selection methods using ANNs in MCP-wind speed methods. A case study
•An analysis is carried out of the benefits of feature selection in MCP methods which use ANNs.•The wrapper approach (WA) generated lower mean errors than the filter approach (FA).•No significant statistical difference was observed between the WA and the FA in certain cases.•The FA generated models...
Saved in:
Published in: | Applied energy 2015-11, Vol.158, p.490-507 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •An analysis is carried out of the benefits of feature selection in MCP methods which use ANNs.•The wrapper approach (WA) generated lower mean errors than the filter approach (FA).•No significant statistical difference was observed between the WA and the FA in certain cases.•The FA generated models somewhat simpler and more interpretable than the WA.•The WA displayed better predictive capacity than the FA, but is more computationally intensive.
Recent studies in the field of renewable energies, and specifically in wind resource prediction, have shown growing interest in proposals for Measure–Correlate–Predict (MCP) methods which simultaneously use data recorded at various reference weather stations. In this context, the use of a high number of reference stations may result in overspecification with its associated negative effects. These include, amongst others, an increase in the estimation error and/or overfitting which could be detrimental to the generalisation capacity of the model when handling new data (prediction).
This paper analyses the benefits of feature selection for use with Artificial Neural Network (ANN) techniques with a multilayer perceptron (MLP) structure when the ANNs are used as MCP methods to predict mean hourly wind speeds at a target site. The features considered in this study were the mean hourly wind speeds and directions recorded in 2003 and 2004 at five weather stations in the Canary Archipelago (Spain).
The two feature selection techniques considered in the analysis were the Correlation Feature Selection (CFS), which is a correlation-based filter approach (FA), and an MLP-based wrapper approach (WA). The metrics used to compare the results were the mean absolute error (MAE), the mean absolute percentage error (MAPE) and the index of agreement (IoA).
Evaluation of the mean errors obtained in the 10-fold cross-validation tests for the year used to represent the short-term wind data period resulted in several conclusions. These included, notably, that the WA gave lower mean errors than the FA in 100% of the cases analysed independently of the metric employed. However, the FA resulted in a significant reduction in computational load and considerable enhancement of model interpretability. When very good correlation coefficients were obtained between the target and reference stations, no significant statistical difference was observed at 5% level between the three models (FA, WA and the models constructed with all the variables) in mo |
---|---|
ISSN: | 0306-2619 1872-9118 |
DOI: | 10.1016/j.apenergy.2015.08.102 |