Learning Where to Learn: Training Distribution Selection for Provable OOD Performance
By: Nicolas Guerra, Nicholas H. Nelsen, Yunan Yang
Potential Business Impact:
Teaches computers to work with new, different data.
Out-of-distribution (OOD) generalization remains a fundamental challenge in machine learning. Models trained on one data distribution often experience substantial performance degradation when evaluated on shifted or unseen domains. To address this challenge, the present paper studies the design of training data distributions that maximize average-case OOD performance. First, a theoretical analysis establishes a family of generalization bounds that quantify how the choice of training distribution influences OOD error across a predefined family of target distributions. These insights motivate the introduction of two complementary algorithmic strategies: (i) directly formulating OOD risk minimization as a bilevel optimization problem over the space of probability measures and (ii) minimizing a theoretical upper bound on OOD error. Last, the paper evaluates the two approaches across a range of function approximation and operator learning examples. The proposed methods significantly improve OOD accuracy over standard empirical risk minimization with a fixed distribution. These results highlight the potential of distribution-aware training as a principled and practical framework for robust OOD generalization.
Similar Papers
A Closer Look at the Learnability of Out-of-Distribution (OOD) Detection
Machine Learning (CS)
Helps computers spot weird, new information.
Out-of-Distribution Generalization in Time Series: A Survey
Machine Learning (CS)
Helps computers learn from changing data better.
Towards More Trustworthy Deep Code Models by Enabling Out-of-Distribution Detection
Software Engineering
Finds code errors computers can't understand.