Empirical parameterization of the Elo Rating System
By: Shirsa Maitra , Tathagata Banerjee , Anushka De and more
This study aims to provide a data-driven approach for empirically tuning and validating rating systems, focusing on the Elo system. Well-known rating frameworks, such as Elo, Glicko, TrueSkill systems, rely on parameters that are usually chosen based on probabilistic assumptions or conventions, and do not utilize game-specific data. To address this issue, we propose a methodology that learns optimal parameter values by maximizing the predictive accuracy of match outcomes. The proposed parameter-tuning framework is a generalizable method that can be extended to any rating system, even for multiplayer setups, through suitable modification of the parameter space. Implementation of the rating system on real and simulated gameplay data demonstrates the suitability of the data-driven rating system in modeling player performance.
Similar Papers
Research Power Ranking: Adapting the Elo System to Quantify Scientist Evaluation
Physics and Society
Ranks scientists by how good their work is.
am-ELO: A Stable Framework for Arena-based LLM Evaluation
Artificial Intelligence
Makes AI judging fairer and more reliable.
PandaSkill - Player Performance and Skill Rating in Esports: Application to League of Legends
Machine Learning (CS)
Rates gamers better by watching their moves.