MCGrad: Multicalibration at Web Scale
By: Lorenzo Perini , Daniel Haimovich , Fridolin Linder and more
Potential Business Impact:
Makes computer predictions fairer for everyone.
We propose MCGrad, a novel and scalable multicalibration algorithm. Multicalibration - calibration in sub-groups of the data - is an important property for the performance of machine learning-based systems. Existing multicalibration methods have thus far received limited traction in industry. We argue that this is because existing methods (1) require such subgroups to be manually specified, which ML practitioners often struggle with, (2) are not scalable, or (3) may harm other notions of model performance such as log loss and Area Under the Precision-Recall Curve (PRAUC). MCGrad does not require explicit specification of protected groups, is scalable, and often improves other ML evaluation metrics instead of harming them. MCGrad has been in production at Meta, and is now part of hundreds of production models. We present results from these deployments as well as results on public datasets.
Similar Papers
Multiaccuracy and Multicalibration via Proxy Groups
Machine Learning (Stat)
Makes computer decisions fair even with missing data.
Auditability and the Landscape of Distance to Multicalibration
Machine Learning (CS)
Makes AI predictions more fair for everyone.
Multicalibration yields better matchings
Machine Learning (CS)
Makes computer matches better even with wrong guesses.