Adapting Skill Ratings to Luck-Based Hidden-Information Games
By: Avirup Chakraborty , Shirsa Maitra , Tathagata Banerjee and more
Rating systems play a crucial role in evaluating player skill across competitive environments. The Elo rating system, originally designed for deterministic and information-complete games such as chess, has been widely adopted and modified in various domains. However, the traditional Elo rating system only considers game outcomes for rating calculation and assumes uniform initial states across players. This raises important methodological challenges in skill modelling for popular partially randomized incomplete-information games such as Rummy. In this paper, we examine the limitations of conventional Elo ratings when applied to luck-driven environments and propose a modified Elo framework specifically tailored for Rummy. Our approach incorporates score-based performance metrics and explicitly models the influence of initial hand quality to disentangle skill from luck. Through extensive simulations involving 270,000 games across six strategies of varying sophistication, we demonstrate that our proposed system achieves stable convergence, superior discriminative power, and enhanced predictive accuracy compared to traditional Elo formulations. The framework maintains computational simplicity while effectively capturing the interplay of skill, strategy, and randomness, with broad applicability to other stochastic competitive environments.
Similar Papers
Quantifying Skill and Chance: A Unified Framework for the Geometry of Games
Artificial Intelligence
Measures how much skill, not luck, wins games.
Rating competitors in games with strength-dependent tie probabilities
Methodology
Makes game ratings fairer by counting ties.
Drawing Conclusions from Draws: Rethinking Preference Semantics in Arena-Style LLM Evaluation
Computation and Language
Makes AI rating systems fairer by understanding "draws."