Score: 3

Handling Label Noise via Instance-Level Difficulty Modeling and Dynamic Optimization

Published: May 1, 2025 | arXiv ID: 2505.00812v3

By: Kuan Zhang , Chengliang Chai , Jingzhe Xu and more

BigTech Affiliations: Massachusetts Institute of Technology

Potential Business Impact:

Fixes computer mistakes from bad data.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Recent studies indicate that deep neural networks degrade in generalization performance under noisy supervision. Existing methods focus on isolating clean subsets or correcting noisy labels, facing limitations such as high computational costs, heavy hyperparameter tuning process, and coarse-grained optimization. To address these challenges, we propose a novel two-stage noisy learning framework that enables instance-level optimization through a dynamically weighted loss function, avoiding hyperparameter tuning. To obtain stable and accurate information about noise modeling, we introduce a simple yet effective metric, termed wrong event, which dynamically models the cleanliness and difficulty of individual samples while maintaining computational costs. Our framework first collects wrong event information and builds a strong base model. Then we perform noise-robust training on the base model, using a probabilistic model to handle the wrong event information of samples. Experiments on five synthetic and real-world LNL benchmarks demonstrate our method surpasses state-of-the-art methods in performance, achieves a nearly 75% reduction in computational time and improves model scalability.

Hide and Seek in Noise Labels: Noise-Robust Collaborative Active Learning with LLM-Powered Assistance

Machine Learning (CS)

Teaches computers to learn from wrong answers.

3 Apr 2025 1

87%

Revisiting Meta-Learning with Noisy Labels: Reweighting Dynamics and Theoretical Guarantees

Machine Learning (CS)

Cleans up messy data for smarter computer learning.

14 Oct 2025 0

87%

Dual Debiasing for Noisy In-Context Learning for Text Generation

Computation and Language

Finds bad examples to make AI smarter.

31 May 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 🇨🇳 United States, China

Page Count

20 pages

Handling Label Noise via Instance-Level Difficulty Modeling and Dynamic Optimization

Fixes computer mistakes from bad data.

Technical Abstract

Hide and Seek in Noise Labels: Noise-Robust Collaborative Active Learning with LLM-Powered Assistance

Revisiting Meta-Learning with Noisy Labels: Reweighting Dynamics and Theoretical Guarantees

Dual Debiasing for Noisy In-Context Learning for Text Generation