Loading…

Low Resource-Reallocation Defense Strategies for Repeated Security Games with No Prior Knowledge and Limited Observability

This paper takes into account general repeated security games with no prior knowledge, i.e., the game payoffs and the attacker's behavior model are unknown, and limited observability. Besides the traditional "regret" criterion", reallocation times" is introduced as an additi...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on cognitive and developmental systems 2023-12, Vol.15 (4), p.1-1
Main Authors: Zhu, Jin, Zhang, Jinglong, Ling, Qiang, Dullerud, Geir E.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper takes into account general repeated security games with no prior knowledge, i.e., the game payoffs and the attacker's behavior model are unknown, and limited observability. Besides the traditional "regret" criterion", reallocation times" is introduced as an additional criterion that provides a more comprehensive evaluation of the defense strategies. For such games, a novel Random-Walk Perturbations with Uniform Exploration (RWP-UE) algorithm is proposed and we deduce the corresponding upper bound of the expected regret and expected reallocation times. Theoretical analysis shows that the RWP-UE algorithm achieves not only low regret with the same magnitude as existing achievements but also fewer reallocation times. Experiments are carried out against four types of attackers, and the results illustrate that the RWP-UE algorithm achieves superior performance.
ISSN:2379-8920
2379-8939
DOI:10.1109/TCDS.2023.3241364