Loading…
Privacy-Cost Management in Smart Meters With Mutual-Information-Based Reinforcement Learning
The rapid development and expansion of the Internet of Things (IoT) paradigm has drastically increased the collection and exchange of data between sensors and systems, a phenomenon that raises serious privacy concerns in some domains. In particular, smart meters (SMs) share fine-grained electricity...
Saved in:
Published in: | IEEE internet of things journal 2022-11, Vol.9 (22), p.22389-22398 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The rapid development and expansion of the Internet of Things (IoT) paradigm has drastically increased the collection and exchange of data between sensors and systems, a phenomenon that raises serious privacy concerns in some domains. In particular, smart meters (SMs) share fine-grained electricity consumption of households with utility providers that can potentially violate users' privacy as sensitive information is leaked through the data. In order to enhance privacy, electricity consumers can exploit the availability of physical resources such as a rechargeable battery (RB) to shape their power demand as dictated by a privacy-cost management unit (PCMU). In this article, we present a novel method to learn the PCMU policy using deep reinforcement learning (DRL). We adopt the mutual information (MI) between the user's demand load and the masked load seen by the power grid as a reliable and general privacy measure. Unlike previous studies, we model the whole temporal correlation in the data to learn the MI in its general form and use a neural network to estimate the MI-based reward signal to guide the PCMU learning process. This approach is combined with a model-free DRL algorithm known as the deep double {Q} -learning (DDQL) method. The performance of the complete DDQL-MI algorithm is assessed empirically using an actual SMs data set and compared with simpler privacy measures. Our results show significant improvements over state-of-the-art privacy-aware demand shaping methods. |
---|---|
ISSN: | 2327-4662 2372-2541 2327-4662 |
DOI: | 10.1109/JIOT.2021.3128488 |