Pointwise Convergence in Games with Conflicting Interest
By: Nanxiang Zhou, Jing Dong, Baoxiang Wang
Potential Business Impact:
Helps game players find fair outcomes faster.
In this work, we introduce the concept of non-negative weighted regret, an extension of non-negative regret \cite{anagnostides2022last} in games. Investigating games with non-negative weighted regret helps us to understand games with conflicting interests, including harmonic games and important classes of zero-sum games.We show that optimistic variants of classical no-regret learning algorithms, namely optimistic mirror descent (OMD) and optimistic follow the regularized leader (OFTRL), converge to an $\epsilon$-approximate Nash equilibrium at a rate of $O(1/\epsilon^2)$.Consequently, they guarantee pointwise convergence to a Nash equilibrium if there are only finitely many Nash equilibria in the game. These algorithms are robust in the sense the convergence holds even if the players deviate Our theoretical findings are supported by empirical evaluations of OMD and OFTRL on the game of matching pennies and harmonic game instances.
Similar Papers
Unregularized Linear Convergence in Zero-Sum Game from Preference Feedback
Machine Learning (CS)
Teaches computers to learn from mistakes better.
Scale-Invariant Regret Matching and Online Learning with Optimal Convergence: Bridging Theory and Practice in Zero-Sum Games
CS and Game Theory
Makes game AI learn faster and better.
Convergence of Regret Matching in Potential Games and Constrained Optimization
CS and Game Theory
Makes AI learn faster for difficult problems.