Calibration Bands for Mean Estimates within the Exponential Dispersion Family
By: Łukasz Delong, Selim Gatti, Mario V. Wüthrich
Potential Business Impact:
Tests if computer predictions are trustworthy.
A statistical model is said to be calibrated if the resulting mean estimates perfectly match the true means of the underlying responses. Aiming for calibration is often not achievable in practice as one has to deal with finite samples of noisy observations. A weaker notion of calibration is auto-calibration. An auto-calibrated model satisfies that the expected value of the responses being given the same mean estimate matches this estimate. Testing for auto-calibration has only been considered recently in the literature and we propose a new approach based on calibration bands. Calibration bands denote a set of lower and upper bounds such that the probability that the true means lie simultaneously inside those bounds exceeds some given confidence level. Such bands were constructed by Yang-Barber (2019) for sub-Gaussian distributions. Dimitriadis et al. (2023) then introduced narrower bands for the Bernoulli distribution and we use the same idea in order to extend the construction to the entire exponential dispersion family that contains for example the binomial, Poisson, negative binomial, gamma and normal distributions. Moreover, we show that the obtained calibration bands allow us to construct various tests for calibration and auto-calibration, respectively.
Similar Papers
Universal Inference for Testing Calibration of Mean Estimates within the Exponential Dispersion Family
Applications
Makes predictions more trustworthy for important decisions.
Simultaneous Nonparametric Confidence Bands for Load-Sharing Systems
Methodology
Makes machines last longer by predicting failures.
Measuring multi-calibration
Methodology
Measures how well predictions work for groups.