Autonomous Concept Drift Threshold Determination
By: Pengqian Lu , Jie Lu , Anjin Liu and more
Potential Business Impact:
Adapts computer learning to spot changes faster.
Existing drift detection methods focus on designing sensitive test statistics. They treat the detection threshold as a fixed hyperparameter, set once to balance false alarms and late detections, and applied uniformly across all datasets and over time. However, maintaining model performance is the key objective from the perspective of machine learning, and we observe that model performance is highly sensitive to this threshold. This observation inspires us to investigate whether a dynamic threshold could be provably better. In this paper, we prove that a threshold that adapts over time can outperform any single fixed threshold. The main idea of the proof is that a dynamic strategy, constructed by combining the best threshold from each individual data segment, is guaranteed to outperform any single threshold that apply to all segments. Based on the theorem, we propose a Dynamic Threshold Determination algorithm. It enhances existing drift detection frameworks with a novel comparison phase to inform how the threshold should be adjusted. Extensive experiments on a wide range of synthetic and real-world datasets, including both image and tabular data, validate that our approach substantially enhances the performance of state-of-the-art drift detectors.
Similar Papers
Flexible and Efficient Drift Detection without Labels
Machine Learning (Stat)
Finds when computer predictions stop being right.
Improving Real-Time Concept Drift Detection using a Hybrid Transformer-Autoencoder Framework
Machine Learning (CS)
Spots changes in computer learning early.
Tackling Time-Series Forecasting Generalization via Mitigating Concept Drift
Machine Learning (CS)
Predicts future events better by spotting changing patterns.