Loading…

Optimizing Battery Storage Systems in Energy Microgrids: A Reinforcement Learning Approach Comparing Multiple Reward Functions

Battery Storage Systems (BSS) are increasingly utilized to enhance renewable energy consumption and operational stability in energy microgrids. However, the uncertainty characterizing renewable energy generation poses significant challenges in the optimal control of BSS. Multiple Reinforcement Learn...

Full description

Saved in:
Bibliographic Details
Main Authors: Ghione, Giorgia, Randazzo, Vincenzo, Badami, Marco, Pasero, Eros
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 646
container_issue
container_start_page 642
container_title
container_volume
creator Ghione, Giorgia
Randazzo, Vincenzo
Badami, Marco
Pasero, Eros
description Battery Storage Systems (BSS) are increasingly utilized to enhance renewable energy consumption and operational stability in energy microgrids. However, the uncertainty characterizing renewable energy generation poses significant challenges in the optimal control of BSS. Multiple Reinforcement Learning (RL) approaches have been presented to solve this optimization problem. However, a comparison of different targets for the training of the RL systems is rarely performed. This work compares different reward functions that enable efficient BSS usage in the power plant of a transport hub while expressing the problem as a partially observable Markov Decision Process (POMDP). A Proximal Policy Optimization (PPO) algorithm is trained using reward functions derived from financial targets and BSS efficiency objectives. Results indicate that reward functions aligning BSS usage with market trends lead to superior performance compared to traditional earnings-based objectives. Furthermore, limitations regarding training episode numbers and reward normalization are identified, suggesting avenues for future research. This study contributes to advancing RL-based approaches for optimal BSS management in energy microgrid environments.
doi_str_mv 10.1109/RTSI61910.2024.10761708
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_10761708</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10761708</ieee_id><sourcerecordid>10761708</sourcerecordid><originalsourceid>FETCH-ieee_primary_107617083</originalsourceid><addsrcrecordid>eNqFj0FLAzEUhKMgWHT_geD7A63Jpu5mvdXSUsEidHsvYfu6PtlNwkuKrAd_uy3o2dMw8zEDI8S9khOlZPWw2dYvhapONpf5dKJkWahSmguRVWVl9KPURa60uRSjvDDluDCqvBZZjB9SSp1LPTV6JL7fQqKevsi18GxTQh6gTp5ti1APMWEfgRwsHHI7wJoa9i3TPj7BDDZI7uC5wR5dgle07M4zsxDY2-Yd5r4Pls_R-tglCh2eKp-W97A8uiaRd_FWXB1sFzH71Rtxt1xs56sxIeIuMPWWh93fNf0P_gHYy1RZ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Optimizing Battery Storage Systems in Energy Microgrids: A Reinforcement Learning Approach Comparing Multiple Reward Functions</title><source>IEEE Xplore All Conference Series</source><creator>Ghione, Giorgia ; Randazzo, Vincenzo ; Badami, Marco ; Pasero, Eros</creator><creatorcontrib>Ghione, Giorgia ; Randazzo, Vincenzo ; Badami, Marco ; Pasero, Eros</creatorcontrib><description>Battery Storage Systems (BSS) are increasingly utilized to enhance renewable energy consumption and operational stability in energy microgrids. However, the uncertainty characterizing renewable energy generation poses significant challenges in the optimal control of BSS. Multiple Reinforcement Learning (RL) approaches have been presented to solve this optimization problem. However, a comparison of different targets for the training of the RL systems is rarely performed. This work compares different reward functions that enable efficient BSS usage in the power plant of a transport hub while expressing the problem as a partially observable Markov Decision Process (POMDP). A Proximal Policy Optimization (PPO) algorithm is trained using reward functions derived from financial targets and BSS efficiency objectives. Results indicate that reward functions aligning BSS usage with market trends lead to superior performance compared to traditional earnings-based objectives. Furthermore, limitations regarding training episode numbers and reward normalization are identified, suggesting avenues for future research. This study contributes to advancing RL-based approaches for optimal BSS management in energy microgrid environments.</description><identifier>EISSN: 2687-6817</identifier><identifier>EISBN: 9798350362138</identifier><identifier>DOI: 10.1109/RTSI61910.2024.10761708</identifier><language>eng</language><publisher>IEEE</publisher><subject>Batteries ; Battery energy storage systems ; Deep Reinforce-ment Learning ; Energy Microgrids ; Microgrids ; Optimal control ; Optimization ; Power generation ; Proximal Policy Optimization ; Reinforcement learning ; Renewable energy sources ; Technological innovation ; Training ; Uncertainty</subject><ispartof>IEEE ... International Forum on Research and Technologies for Society and Industry (Online), 2024, p.642-646</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10761708$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,27925,54555,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10761708$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Ghione, Giorgia</creatorcontrib><creatorcontrib>Randazzo, Vincenzo</creatorcontrib><creatorcontrib>Badami, Marco</creatorcontrib><creatorcontrib>Pasero, Eros</creatorcontrib><title>Optimizing Battery Storage Systems in Energy Microgrids: A Reinforcement Learning Approach Comparing Multiple Reward Functions</title><title>IEEE ... International Forum on Research and Technologies for Society and Industry (Online)</title><addtitle>RTSI</addtitle><description>Battery Storage Systems (BSS) are increasingly utilized to enhance renewable energy consumption and operational stability in energy microgrids. However, the uncertainty characterizing renewable energy generation poses significant challenges in the optimal control of BSS. Multiple Reinforcement Learning (RL) approaches have been presented to solve this optimization problem. However, a comparison of different targets for the training of the RL systems is rarely performed. This work compares different reward functions that enable efficient BSS usage in the power plant of a transport hub while expressing the problem as a partially observable Markov Decision Process (POMDP). A Proximal Policy Optimization (PPO) algorithm is trained using reward functions derived from financial targets and BSS efficiency objectives. Results indicate that reward functions aligning BSS usage with market trends lead to superior performance compared to traditional earnings-based objectives. Furthermore, limitations regarding training episode numbers and reward normalization are identified, suggesting avenues for future research. This study contributes to advancing RL-based approaches for optimal BSS management in energy microgrid environments.</description><subject>Batteries</subject><subject>Battery energy storage systems</subject><subject>Deep Reinforce-ment Learning</subject><subject>Energy Microgrids</subject><subject>Microgrids</subject><subject>Optimal control</subject><subject>Optimization</subject><subject>Power generation</subject><subject>Proximal Policy Optimization</subject><subject>Reinforcement learning</subject><subject>Renewable energy sources</subject><subject>Technological innovation</subject><subject>Training</subject><subject>Uncertainty</subject><issn>2687-6817</issn><isbn>9798350362138</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2024</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNqFj0FLAzEUhKMgWHT_geD7A63Jpu5mvdXSUsEidHsvYfu6PtlNwkuKrAd_uy3o2dMw8zEDI8S9khOlZPWw2dYvhapONpf5dKJkWahSmguRVWVl9KPURa60uRSjvDDluDCqvBZZjB9SSp1LPTV6JL7fQqKevsi18GxTQh6gTp5ti1APMWEfgRwsHHI7wJoa9i3TPj7BDDZI7uC5wR5dgle07M4zsxDY2-Yd5r4Pls_R-tglCh2eKp-W97A8uiaRd_FWXB1sFzH71Rtxt1xs56sxIeIuMPWWh93fNf0P_gHYy1RZ</recordid><startdate>20240918</startdate><enddate>20240918</enddate><creator>Ghione, Giorgia</creator><creator>Randazzo, Vincenzo</creator><creator>Badami, Marco</creator><creator>Pasero, Eros</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>20240918</creationdate><title>Optimizing Battery Storage Systems in Energy Microgrids: A Reinforcement Learning Approach Comparing Multiple Reward Functions</title><author>Ghione, Giorgia ; Randazzo, Vincenzo ; Badami, Marco ; Pasero, Eros</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-ieee_primary_107617083</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Batteries</topic><topic>Battery energy storage systems</topic><topic>Deep Reinforce-ment Learning</topic><topic>Energy Microgrids</topic><topic>Microgrids</topic><topic>Optimal control</topic><topic>Optimization</topic><topic>Power generation</topic><topic>Proximal Policy Optimization</topic><topic>Reinforcement learning</topic><topic>Renewable energy sources</topic><topic>Technological innovation</topic><topic>Training</topic><topic>Uncertainty</topic><toplevel>online_resources</toplevel><creatorcontrib>Ghione, Giorgia</creatorcontrib><creatorcontrib>Randazzo, Vincenzo</creatorcontrib><creatorcontrib>Badami, Marco</creatorcontrib><creatorcontrib>Pasero, Eros</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE/IET Electronic Library</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ghione, Giorgia</au><au>Randazzo, Vincenzo</au><au>Badami, Marco</au><au>Pasero, Eros</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Optimizing Battery Storage Systems in Energy Microgrids: A Reinforcement Learning Approach Comparing Multiple Reward Functions</atitle><btitle>IEEE ... International Forum on Research and Technologies for Society and Industry (Online)</btitle><stitle>RTSI</stitle><date>2024-09-18</date><risdate>2024</risdate><spage>642</spage><epage>646</epage><pages>642-646</pages><eissn>2687-6817</eissn><eisbn>9798350362138</eisbn><abstract>Battery Storage Systems (BSS) are increasingly utilized to enhance renewable energy consumption and operational stability in energy microgrids. However, the uncertainty characterizing renewable energy generation poses significant challenges in the optimal control of BSS. Multiple Reinforcement Learning (RL) approaches have been presented to solve this optimization problem. However, a comparison of different targets for the training of the RL systems is rarely performed. This work compares different reward functions that enable efficient BSS usage in the power plant of a transport hub while expressing the problem as a partially observable Markov Decision Process (POMDP). A Proximal Policy Optimization (PPO) algorithm is trained using reward functions derived from financial targets and BSS efficiency objectives. Results indicate that reward functions aligning BSS usage with market trends lead to superior performance compared to traditional earnings-based objectives. Furthermore, limitations regarding training episode numbers and reward normalization are identified, suggesting avenues for future research. This study contributes to advancing RL-based approaches for optimal BSS management in energy microgrid environments.</abstract><pub>IEEE</pub><doi>10.1109/RTSI61910.2024.10761708</doi></addata></record>
fulltext fulltext_linktorsrc
identifier EISSN: 2687-6817
ispartof IEEE ... International Forum on Research and Technologies for Society and Industry (Online), 2024, p.642-646
issn 2687-6817
language eng
recordid cdi_ieee_primary_10761708
source IEEE Xplore All Conference Series
subjects Batteries
Battery energy storage systems
Deep Reinforce-ment Learning
Energy Microgrids
Microgrids
Optimal control
Optimization
Power generation
Proximal Policy Optimization
Reinforcement learning
Renewable energy sources
Technological innovation
Training
Uncertainty
title Optimizing Battery Storage Systems in Energy Microgrids: A Reinforcement Learning Approach Comparing Multiple Reward Functions
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T16%3A43%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Optimizing%20Battery%20Storage%20Systems%20in%20Energy%20Microgrids:%20A%20Reinforcement%20Learning%20Approach%20Comparing%20Multiple%20Reward%20Functions&rft.btitle=IEEE%20...%20International%20Forum%20on%20Research%20and%20Technologies%20for%20Society%20and%20Industry%20(Online)&rft.au=Ghione,%20Giorgia&rft.date=2024-09-18&rft.spage=642&rft.epage=646&rft.pages=642-646&rft.eissn=2687-6817&rft_id=info:doi/10.1109/RTSI61910.2024.10761708&rft.eisbn=9798350362138&rft_dat=%3Cieee_CHZPO%3E10761708%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-ieee_primary_107617083%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10761708&rfr_iscdi=true