Loading…
On strategy improvement algorithms for simple stochastic games
The study of simple stochastic games (SSGs) was initiated by Condon for analyzing the computational power of randomized space-bounded alternating Turing machines. The game is played by two players, MAX and MIN, on a directed multigraph, and when the play terminates at a sink vertex s, MAX wins from...
Saved in:
Published in: | Journal of discrete algorithms (Amsterdam, Netherlands) Netherlands), 2011-09, Vol.9 (3), p.263-278 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The study of
simple stochastic games (SSGs) was initiated by Condon for analyzing the computational power of randomized space-bounded alternating Turing machines. The game is played by two players, MAX and MIN, on a directed multigraph, and when the play terminates at a sink vertex
s, MAX wins from MIN a payoff
p
(
s
)
∈
[
0
,
1
]
. Condon proved that the problem SSG-VALUE—given a SSG, determine whether the expected payoff won by MAX is greater than 1/2 when both players use their optimal strategies—is in
NP
∩
coNP
. However, the exact complexity of this problem remains open, as it is not known whether the problem is in P or is hard for some natural complexity class. In this paper, we study the computational complexity of a strategy improvement algorithm by Hoffman and Karp for this problem. The Hoffman–Karp algorithm converges to optimal strategies of a given SSG, but no non-trivial bounds were previously known on its running time. We prove a bound of
O
(
2
n
/
n
)
on the convergence time of the Hoffman–Karp algorithm, and a bound of
O
(
2
0.78
n
)
on a randomized variant. These are the first non-trivial upper bounds on the convergence time of these strategy improvement algorithms. |
---|---|
ISSN: | 1570-8667 1570-8675 |
DOI: | 10.1016/j.jda.2011.03.007 |