Score: 0

Strategy Improvement, the Simplex Algorithm and Lopsidedness

Published: September 19, 2025 | arXiv ID: 2509.16075v1

By: Matthew Maat

Potential Business Impact:

Links game solving to a faster math trick.

Business Areas:

Social Entrepreneurship Community and Lifestyle

The strategy improvement algorithm for mean payoff games and parity games is a local improvement algorithm, just like the simplex algorithm for linear programs. Their similarity has turned out very useful: many lower bounds on running time for the simplex method have been created from lower bounds for strategy improvement. However, earlier connections between these algorithms required constructing an intermediate Markov decision process, which is not always possible. We prove a formal, direct connection between the two algorithms, showing that many variants of strategy improvement for parity and mean payoff games are truly an instance of the simplex algorithm, under mild nondegeneracy assumptions. As a result of this, we derive some combinatorial properties of the structure of strategy sets of various related games on graphs. In particular, we show a connection to lopsided sets.

Locally Optimal Solutions for Integer Programming Games

CS and Game Theory

Finds better game answers faster than before.

26 Mar 2025 0

84%

Study and improvement of search algorithms in two-players perfect information games

Artificial Intelligence

New AI plays many games better and faster.

6 May 2025 1

84%

Choosing What Game to Play without Selecting Equilibria: Inferring Safe (Pareto) Improvements in Binary Constraint Structures

CS and Game Theory

Helps pick the best game for everyone to play.

26 Nov 2025 0

View PDF Login to Bookmark

Page Count

19 pages

Strategy Improvement, the Simplex Algorithm and Lopsidedness

Links game solving to a faster math trick.

Technical Abstract

Locally Optimal Solutions for Integer Programming Games

Study and improvement of search algorithms in two-players perfect information games

Choosing What Game to Play without Selecting Equilibria: Inferring Safe (Pareto) Improvements in Binary Constraint Structures