Loading…

Privacy-Cost Management in Smart Meters With Mutual-Information-Based Reinforcement Learning

The rapid development and expansion of the Internet of Things (IoT) paradigm has drastically increased the collection and exchange of data between sensors and systems, a phenomenon that raises serious privacy concerns in some domains. In particular, smart meters (SMs) share fine-grained electricity...

Full description

Saved in:
Bibliographic Details
Published in:IEEE internet of things journal 2022-11, Vol.9 (22), p.22389-22398
Main Authors: Shateri, Mohammadhadi, Messina, Francisco, Piantanida, Pablo, Labeau, Fabrice
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The rapid development and expansion of the Internet of Things (IoT) paradigm has drastically increased the collection and exchange of data between sensors and systems, a phenomenon that raises serious privacy concerns in some domains. In particular, smart meters (SMs) share fine-grained electricity consumption of households with utility providers that can potentially violate users' privacy as sensitive information is leaked through the data. In order to enhance privacy, electricity consumers can exploit the availability of physical resources such as a rechargeable battery (RB) to shape their power demand as dictated by a privacy-cost management unit (PCMU). In this article, we present a novel method to learn the PCMU policy using deep reinforcement learning (DRL). We adopt the mutual information (MI) between the user's demand load and the masked load seen by the power grid as a reliable and general privacy measure. Unlike previous studies, we model the whole temporal correlation in the data to learn the MI in its general form and use a neural network to estimate the MI-based reward signal to guide the PCMU learning process. This approach is combined with a model-free DRL algorithm known as the deep double {Q} -learning (DDQL) method. The performance of the complete DDQL-MI algorithm is assessed empirically using an actual SMs data set and compared with simpler privacy measures. Our results show significant improvements over state-of-the-art privacy-aware demand shaping methods.
ISSN:2327-4662
2372-2541
2327-4662
DOI:10.1109/JIOT.2021.3128488