Loading…
Asymptotically Optimal Strategies for Adaptive Zero-Sum Discounted Markov Games
We consider a class of discrete-time two person zero-sum Markov games with Borel state and action spaces, and possibly unbounded payoffs. The game evolves according to the recursive equation ..., where the disturbance process ... is formed by independent and identically distributed R^sup k^-valued r...
Saved in:
Published in: | SIAM journal on control and optimization 2009-01, Vol.48 (3), p.1405-1421 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | We consider a class of discrete-time two person zero-sum Markov games with Borel state and action spaces, and possibly unbounded payoffs. The game evolves according to the recursive equation ..., where the disturbance process ... is formed by independent and identically distributed R^sup k^-valued random vectors, which are observable but whose common density ρ is unknown to both players. Under certain continuity and compactness conditions, we combine a nonstationary iteration procedure and suitable density estimation methods to construct asymptotically discounted optimal strategies for both players. [PUBLICATION ABSTRACT] |
---|---|
ISSN: | 0363-0129 1095-7138 |
DOI: | 10.1137/060651458 |