Score: 0

Unregularized Linear Convergence in Zero-Sum Game from Preference Feedback

Published: December 31, 2025 | arXiv ID: 2512.24818v1

By: Shulun Chen , Runlong Zhou , Zihan Zhang and more

Aligning large language models (LLMs) with human preferences has proven effective for enhancing model capabilities, yet standard preference modeling using the Bradley-Terry model assumes transitivity, overlooking the inherent complexity of human population preferences. Nash learning from human feedback (NLHF) addresses this by framing non-transitive preferences as a two-player zero-sum game, where alignment reduces to finding the Nash equilibrium (NE). However, existing algorithms typically rely on regularization, incurring unavoidable bias when computing the duality gap in the original game. In this work, we provide the first convergence guarantee for Optimistic Multiplicative Weights Update ($\mathtt{OMWU}$) in NLHF, showing that it achieves last-iterate linear convergence after a burn-in phase whenever an NE with full support exists, with an instance-dependent linear convergence rate to the original NE, measured by duality gaps. Compared to prior results in Wei et al. (2020), we do not require the assumption of NE uniqueness. Our analysis identifies a novel marginal convergence behavior, where the probability of rarely played actions grows exponentially from exponentially small values, enabling exponentially better dependence on instance-dependent constants than prior results. Experiments corroborate the theoretical strengths of $\mathtt{OMWU}$ in both tabular and neural policy classes, demonstrating its potential for LLM applications.

Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees

Machine Learning (CS)

Teaches AI to have better, longer talks.

18 Feb 2025 2

89%

Breaking $1/ε$ Barrier in Quantum Zero-Sum Games: Generalizing Metric Subregularity for Spectraplexes

CS and Game Theory

Makes quantum games solve problems faster than before.

25 Sep 2025 0

89%

Pointwise Convergence in Games with Conflicting Interest

CS and Game Theory

Helps game players find fair outcomes faster.

21 May 2025 0

View PDF Login to Bookmark

Unregularized Linear Convergence in Zero-Sum Game from Preference Feedback

Technical Abstract

Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees

Breaking $1/ε$ Barrier in Quantum Zero-Sum Games: Generalizing Metric Subregularity for Spectraplexes

Pointwise Convergence in Games with Conflicting Interest