Loading…

Self-attention mechanism to enhance the generalizability of data-driven time-series prediction: A case study of intra-hour power forecasting of urban distributed photovoltaic systems

The emergence of small-scale urban distributed solar generation (DSG) has urged the exploration of site-adaptive forecasting models designed to accurately predict future power outputs for unseen DSGs. In such scenarios, with numerous DSGs spread across utility-scale cities and a lack of historical d...

Full description

Saved in:
Bibliographic Details
Published in:Applied energy 2024-11, Vol.374, p.124007, Article 124007
Main Authors: Yu, Hanxin, Chen, Shanlin, Chu, Yinghao, Li, Mengying, Ding, Yueming, Cui, Rongxi, Zhao, Xin
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c189t-d351e9629153c969ae6fffc3295f5e91b586e441d931e27452e7ec9dc52161153
container_end_page
container_issue
container_start_page 124007
container_title Applied energy
container_volume 374
creator Yu, Hanxin
Chen, Shanlin
Chu, Yinghao
Li, Mengying
Ding, Yueming
Cui, Rongxi
Zhao, Xin
description The emergence of small-scale urban distributed solar generation (DSG) has urged the exploration of site-adaptive forecasting models designed to accurately predict future power outputs for unseen DSGs. In such scenarios, with numerous DSGs spread across utility-scale cities and a lack of historical data, it is not economically viable to use conventional approaches that develop individual models for each DSG. Therefore, this work aims to tackle this real-world challenge by adapting the state-of-the-art, attention-based temporal fusion transformer (TFT) model to 188 real-world operational DSG data, thereby validating the generalizability of self-attention mechanism for multi-step time series forecasting. When adapted to unseen DSGs without training data, the experiment results demonstrate that the proposed solar TFT (STFT) improves by 11.07%, 17.58%, and 22.76% over the persistence model at the 10-, 20-, and 30-minute forecasts, respectively. Even when compared to representative deep-learning models, such as a long short-term memory model specialized in time series forecasting, STFT has demonstrated improved forecast accuracy, achieving 3.34%, 4.18%, and 5.85% enhancements at the 10-, 20-, and 30-minute forecast horizons, respectively. However, the model architecture of STFT is more complex, and the computational cost associated with it is relatively higher compared to other deep learning models. This trade-off between accuracy and computational efficiency should be considered in practical applications. The forecast performance is analyzed in three typical weather conditions, namely, clear, partly cloudy, and overcast. STFT demonstrates advantages in high variability periods, especially during weather transition periods, where reference models experience lagged predictions yielding relatively large errors. •The self-attention mechanism enhances the generalizability of distributed PV forecasts.•The multi-step forecasts are validated using real-world operational data.•The proposed model outperforms the reference models when adapted to unseen DSGs.•The performance of the proposed model is superior in highly variable weather.
doi_str_mv 10.1016/j.apenergy.2024.124007
format article
fullrecord <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_apenergy_2024_124007</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0306261924013904</els_id><sourcerecordid>S0306261924013904</sourcerecordid><originalsourceid>FETCH-LOGICAL-c189t-d351e9629153c969ae6fffc3295f5e91b586e441d931e27452e7ec9dc52161153</originalsourceid><addsrcrecordid>eNqFkMGO1DAMhnsAiWXhFZBfoEOStpkNJ1YrYJFW4gCco0zizHjUJlXiGTQ8GM9Hy8CZky3Z3y_7a5o3UmykkPrtceNmTFj2l40Sqt9I1QuxfdbciE7oVmlpXjQvaz0KIZRU4qb59RXH2DpmTEw5wYT-4BLVCTgDpqX3CHxA2K-xbqSfbkcj8QVyhODYtaHQGRMwTdhWLIQV5oKB_Jr3Du7Bu4pQ-RT-MJS4uPaQTwXm_AMLxFxwWWFK-3V-KjuXIFDlQrsTY4D5kDmf88iOPNRLZZzqq-Z5dGPF13_rbfP944dvD4_t05dPnx_un1ov7wy3oRskGq2MHDpvtHGoY4y-U2aIAxq5G-409r0MppOotv2gcIveBD8oqeUC3Tb6mutLrrVgtHOhyZWLlcKuxu3R_jNuV-P2anwB319BXK47ExZbPeEiM9DyLtuQ6X8RvwEO6pUV</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Self-attention mechanism to enhance the generalizability of data-driven time-series prediction: A case study of intra-hour power forecasting of urban distributed photovoltaic systems</title><source>ScienceDirect Freedom Collection</source><creator>Yu, Hanxin ; Chen, Shanlin ; Chu, Yinghao ; Li, Mengying ; Ding, Yueming ; Cui, Rongxi ; Zhao, Xin</creator><creatorcontrib>Yu, Hanxin ; Chen, Shanlin ; Chu, Yinghao ; Li, Mengying ; Ding, Yueming ; Cui, Rongxi ; Zhao, Xin</creatorcontrib><description>The emergence of small-scale urban distributed solar generation (DSG) has urged the exploration of site-adaptive forecasting models designed to accurately predict future power outputs for unseen DSGs. In such scenarios, with numerous DSGs spread across utility-scale cities and a lack of historical data, it is not economically viable to use conventional approaches that develop individual models for each DSG. Therefore, this work aims to tackle this real-world challenge by adapting the state-of-the-art, attention-based temporal fusion transformer (TFT) model to 188 real-world operational DSG data, thereby validating the generalizability of self-attention mechanism for multi-step time series forecasting. When adapted to unseen DSGs without training data, the experiment results demonstrate that the proposed solar TFT (STFT) improves by 11.07%, 17.58%, and 22.76% over the persistence model at the 10-, 20-, and 30-minute forecasts, respectively. Even when compared to representative deep-learning models, such as a long short-term memory model specialized in time series forecasting, STFT has demonstrated improved forecast accuracy, achieving 3.34%, 4.18%, and 5.85% enhancements at the 10-, 20-, and 30-minute forecast horizons, respectively. However, the model architecture of STFT is more complex, and the computational cost associated with it is relatively higher compared to other deep learning models. This trade-off between accuracy and computational efficiency should be considered in practical applications. The forecast performance is analyzed in three typical weather conditions, namely, clear, partly cloudy, and overcast. STFT demonstrates advantages in high variability periods, especially during weather transition periods, where reference models experience lagged predictions yielding relatively large errors. •The self-attention mechanism enhances the generalizability of distributed PV forecasts.•The multi-step forecasts are validated using real-world operational data.•The proposed model outperforms the reference models when adapted to unseen DSGs.•The performance of the proposed model is superior in highly variable weather.</description><identifier>ISSN: 0306-2619</identifier><identifier>DOI: 10.1016/j.apenergy.2024.124007</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Attention mechanism ; Data-driven models ; Distributed photovoltaic ; Model generalizability ; Solar forecast</subject><ispartof>Applied energy, 2024-11, Vol.374, p.124007, Article 124007</ispartof><rights>2024 Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c189t-d351e9629153c969ae6fffc3295f5e91b586e441d931e27452e7ec9dc52161153</cites><orcidid>0009-0007-5020-8968 ; 0000-0003-1651-4324 ; 0009-0007-2898-6587 ; 0000-0003-2457-8264</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Yu, Hanxin</creatorcontrib><creatorcontrib>Chen, Shanlin</creatorcontrib><creatorcontrib>Chu, Yinghao</creatorcontrib><creatorcontrib>Li, Mengying</creatorcontrib><creatorcontrib>Ding, Yueming</creatorcontrib><creatorcontrib>Cui, Rongxi</creatorcontrib><creatorcontrib>Zhao, Xin</creatorcontrib><title>Self-attention mechanism to enhance the generalizability of data-driven time-series prediction: A case study of intra-hour power forecasting of urban distributed photovoltaic systems</title><title>Applied energy</title><description>The emergence of small-scale urban distributed solar generation (DSG) has urged the exploration of site-adaptive forecasting models designed to accurately predict future power outputs for unseen DSGs. In such scenarios, with numerous DSGs spread across utility-scale cities and a lack of historical data, it is not economically viable to use conventional approaches that develop individual models for each DSG. Therefore, this work aims to tackle this real-world challenge by adapting the state-of-the-art, attention-based temporal fusion transformer (TFT) model to 188 real-world operational DSG data, thereby validating the generalizability of self-attention mechanism for multi-step time series forecasting. When adapted to unseen DSGs without training data, the experiment results demonstrate that the proposed solar TFT (STFT) improves by 11.07%, 17.58%, and 22.76% over the persistence model at the 10-, 20-, and 30-minute forecasts, respectively. Even when compared to representative deep-learning models, such as a long short-term memory model specialized in time series forecasting, STFT has demonstrated improved forecast accuracy, achieving 3.34%, 4.18%, and 5.85% enhancements at the 10-, 20-, and 30-minute forecast horizons, respectively. However, the model architecture of STFT is more complex, and the computational cost associated with it is relatively higher compared to other deep learning models. This trade-off between accuracy and computational efficiency should be considered in practical applications. The forecast performance is analyzed in three typical weather conditions, namely, clear, partly cloudy, and overcast. STFT demonstrates advantages in high variability periods, especially during weather transition periods, where reference models experience lagged predictions yielding relatively large errors. •The self-attention mechanism enhances the generalizability of distributed PV forecasts.•The multi-step forecasts are validated using real-world operational data.•The proposed model outperforms the reference models when adapted to unseen DSGs.•The performance of the proposed model is superior in highly variable weather.</description><subject>Attention mechanism</subject><subject>Data-driven models</subject><subject>Distributed photovoltaic</subject><subject>Model generalizability</subject><subject>Solar forecast</subject><issn>0306-2619</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNqFkMGO1DAMhnsAiWXhFZBfoEOStpkNJ1YrYJFW4gCco0zizHjUJlXiGTQ8GM9Hy8CZky3Z3y_7a5o3UmykkPrtceNmTFj2l40Sqt9I1QuxfdbciE7oVmlpXjQvaz0KIZRU4qb59RXH2DpmTEw5wYT-4BLVCTgDpqX3CHxA2K-xbqSfbkcj8QVyhODYtaHQGRMwTdhWLIQV5oKB_Jr3Du7Bu4pQ-RT-MJS4uPaQTwXm_AMLxFxwWWFK-3V-KjuXIFDlQrsTY4D5kDmf88iOPNRLZZzqq-Z5dGPF13_rbfP944dvD4_t05dPnx_un1ov7wy3oRskGq2MHDpvtHGoY4y-U2aIAxq5G-409r0MppOotv2gcIveBD8oqeUC3Tb6mutLrrVgtHOhyZWLlcKuxu3R_jNuV-P2anwB319BXK47ExZbPeEiM9DyLtuQ6X8RvwEO6pUV</recordid><startdate>20241115</startdate><enddate>20241115</enddate><creator>Yu, Hanxin</creator><creator>Chen, Shanlin</creator><creator>Chu, Yinghao</creator><creator>Li, Mengying</creator><creator>Ding, Yueming</creator><creator>Cui, Rongxi</creator><creator>Zhao, Xin</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0009-0007-5020-8968</orcidid><orcidid>https://orcid.org/0000-0003-1651-4324</orcidid><orcidid>https://orcid.org/0009-0007-2898-6587</orcidid><orcidid>https://orcid.org/0000-0003-2457-8264</orcidid></search><sort><creationdate>20241115</creationdate><title>Self-attention mechanism to enhance the generalizability of data-driven time-series prediction: A case study of intra-hour power forecasting of urban distributed photovoltaic systems</title><author>Yu, Hanxin ; Chen, Shanlin ; Chu, Yinghao ; Li, Mengying ; Ding, Yueming ; Cui, Rongxi ; Zhao, Xin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c189t-d351e9629153c969ae6fffc3295f5e91b586e441d931e27452e7ec9dc52161153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Attention mechanism</topic><topic>Data-driven models</topic><topic>Distributed photovoltaic</topic><topic>Model generalizability</topic><topic>Solar forecast</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yu, Hanxin</creatorcontrib><creatorcontrib>Chen, Shanlin</creatorcontrib><creatorcontrib>Chu, Yinghao</creatorcontrib><creatorcontrib>Li, Mengying</creatorcontrib><creatorcontrib>Ding, Yueming</creatorcontrib><creatorcontrib>Cui, Rongxi</creatorcontrib><creatorcontrib>Zhao, Xin</creatorcontrib><collection>CrossRef</collection><jtitle>Applied energy</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yu, Hanxin</au><au>Chen, Shanlin</au><au>Chu, Yinghao</au><au>Li, Mengying</au><au>Ding, Yueming</au><au>Cui, Rongxi</au><au>Zhao, Xin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Self-attention mechanism to enhance the generalizability of data-driven time-series prediction: A case study of intra-hour power forecasting of urban distributed photovoltaic systems</atitle><jtitle>Applied energy</jtitle><date>2024-11-15</date><risdate>2024</risdate><volume>374</volume><spage>124007</spage><pages>124007-</pages><artnum>124007</artnum><issn>0306-2619</issn><abstract>The emergence of small-scale urban distributed solar generation (DSG) has urged the exploration of site-adaptive forecasting models designed to accurately predict future power outputs for unseen DSGs. In such scenarios, with numerous DSGs spread across utility-scale cities and a lack of historical data, it is not economically viable to use conventional approaches that develop individual models for each DSG. Therefore, this work aims to tackle this real-world challenge by adapting the state-of-the-art, attention-based temporal fusion transformer (TFT) model to 188 real-world operational DSG data, thereby validating the generalizability of self-attention mechanism for multi-step time series forecasting. When adapted to unseen DSGs without training data, the experiment results demonstrate that the proposed solar TFT (STFT) improves by 11.07%, 17.58%, and 22.76% over the persistence model at the 10-, 20-, and 30-minute forecasts, respectively. Even when compared to representative deep-learning models, such as a long short-term memory model specialized in time series forecasting, STFT has demonstrated improved forecast accuracy, achieving 3.34%, 4.18%, and 5.85% enhancements at the 10-, 20-, and 30-minute forecast horizons, respectively. However, the model architecture of STFT is more complex, and the computational cost associated with it is relatively higher compared to other deep learning models. This trade-off between accuracy and computational efficiency should be considered in practical applications. The forecast performance is analyzed in three typical weather conditions, namely, clear, partly cloudy, and overcast. STFT demonstrates advantages in high variability periods, especially during weather transition periods, where reference models experience lagged predictions yielding relatively large errors. •The self-attention mechanism enhances the generalizability of distributed PV forecasts.•The multi-step forecasts are validated using real-world operational data.•The proposed model outperforms the reference models when adapted to unseen DSGs.•The performance of the proposed model is superior in highly variable weather.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.apenergy.2024.124007</doi><orcidid>https://orcid.org/0009-0007-5020-8968</orcidid><orcidid>https://orcid.org/0000-0003-1651-4324</orcidid><orcidid>https://orcid.org/0009-0007-2898-6587</orcidid><orcidid>https://orcid.org/0000-0003-2457-8264</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0306-2619
ispartof Applied energy, 2024-11, Vol.374, p.124007, Article 124007
issn 0306-2619
language eng
recordid cdi_crossref_primary_10_1016_j_apenergy_2024_124007
source ScienceDirect Freedom Collection
subjects Attention mechanism
Data-driven models
Distributed photovoltaic
Model generalizability
Solar forecast
title Self-attention mechanism to enhance the generalizability of data-driven time-series prediction: A case study of intra-hour power forecasting of urban distributed photovoltaic systems
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T07%3A23%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Self-attention%20mechanism%20to%20enhance%20the%20generalizability%20of%20data-driven%20time-series%20prediction:%20A%20case%20study%20of%20intra-hour%20power%20forecasting%20of%20urban%20distributed%20photovoltaic%20systems&rft.jtitle=Applied%20energy&rft.au=Yu,%20Hanxin&rft.date=2024-11-15&rft.volume=374&rft.spage=124007&rft.pages=124007-&rft.artnum=124007&rft.issn=0306-2619&rft_id=info:doi/10.1016/j.apenergy.2024.124007&rft_dat=%3Celsevier_cross%3ES0306261924013904%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c189t-d351e9629153c969ae6fffc3295f5e91b586e441d931e27452e7ec9dc52161153%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true