Widest Path Games and Maximality Inheritance in Bounded Value Iteration for Stochastic Games
By: Kittiphon Phalakarn, Yun Chen Tsai, Ichiro Hasuo
Potential Business Impact:
Improves computer games with tricky goals.
For model checking stochastic games (SGs), bounded value iteration (BVI) algorithms have gained attention as efficient approximate methods with rigorous precision guarantees. However, BVI may not terminate or converge when the target SG contains end components. Most existing approaches address this issue by explicitly detecting and processing end components--a process that is often computationally expensive. An exception is the widest path-based BVI approach previously studied by Phalakarn et al., which we refer to as 1WP-BVI. The method performs particularly well in the presence of numerous end components. Nonetheless, its theoretical foundations remain somewhat ad hoc. In this paper, we identify and formalize the core principles underlying the widest path-based BVI approach by (i) presenting 2WP-BVI, a clean BVI algorithm based on (2-player) widest path games, and (ii) proving its correctness using what we call the maximality inheritance principle--a proof principle previously employed in a well-known result in probabilistic model checking. Our experimental results demonstrate the practical relevance and potential of our proposed 2WP-BVI algorithm.
Similar Papers
Sound Value Iteration for Simple Stochastic Games
CS and Game Theory
Makes computer decisions more accurate and faster.
ε-Optimally Solving Two-Player Zero-Sum POSGs
CS and Game Theory
Lets computers play games they can't fully see.
Geometric Re-Analysis of Classical MDP Solving Algorithms
Machine Learning (CS)
Makes computer learning faster and more reliable.