Score: 0

Calibration improves detection of mislabeled examples

Published: November 4, 2025 | arXiv ID: 2511.02738v1

By: Ilies Chibane , Thomas George , Pierre Nodet and more

Potential Business Impact:

Fixes computer learning mistakes in messy data.

Business Areas:

Image Recognition Data and Analytics, Software

Mislabeled data is a pervasive issue that undermines the performance of machine learning systems in real-world applications. An effective approach to mitigate this problem is to detect mislabeled instances and subject them to special treatment, such as filtering or relabeling. Automatic mislabeling detection methods typically rely on training a base machine learning model and then probing it for each instance to obtain a trust score that each provided label is genuine or incorrect. The properties of this base model are thus of paramount importance. In this paper, we investigate the impact of calibrating this model. Our empirical results show that using calibration methods improves the accuracy and robustness of mislabeled instance detection, providing a practical and effective solution for industrial applications.

Monitoring the calibration of probability forecasts with an application to concept drift detection involving image classification

Machine Learning (Stat)

Keeps computer vision accurate over time.

29 Oct 2025 0

88%

Calibration and Discrimination Optimization Using Clusters of Learned Representation

Machine Learning (CS)

Makes computer predictions more trustworthy for doctors.

22 Oct 2025 0

88%

Calibration through the Lens of Indistinguishability

Machine Learning (CS)

Makes computer guesses match real-world results.

2 Sep 2025 0

View PDF Login to Bookmark

Page Count

11 pages

Calibration improves detection of mislabeled examples

Fixes computer learning mistakes in messy data.

Technical Abstract

Monitoring the calibration of probability forecasts with an application to concept drift detection involving image classification

Calibration and Discrimination Optimization Using Clusters of Learned Representation

Calibration through the Lens of Indistinguishability