ALINE: Joint Amortization for Bayesian Inference and Active Data Acquisition
By: Daolang Huang , Xinyi Wen , Ayush Bharti and more
Potential Business Impact:
Finds the best information to learn quickly.
Many critical applications, from autonomous scientific discovery to personalized medicine, demand systems that can both strategically acquire the most informative data and instantaneously perform inference based upon it. While amortized methods for Bayesian inference and experimental design offer part of the solution, neither approach is optimal in the most general and challenging task, where new data needs to be collected for instant inference. To tackle this issue, we introduce the Amortized Active Learning and Inference Engine (ALINE), a unified framework for amortized Bayesian inference and active data acquisition. ALINE leverages a transformer architecture trained via reinforcement learning with a reward based on self-estimated information gain provided by its own integrated inference component. This allows it to strategically query informative data points while simultaneously refining its predictions. Moreover, ALINE can selectively direct its querying strategy towards specific subsets of model parameters or designated predictive tasks, optimizing for posterior estimation, data prediction, or a mixture thereof. Empirical results on regression-based active learning, classical Bayesian experimental design benchmarks, and a psychometric model with selectively targeted parameters demonstrate that ALINE delivers both instant and accurate inference along with efficient selection of informative points.
Similar Papers
Amortized Safe Active Learning for Real-Time Data Acquisition: Pretrained Neural Policies from Simulated Nonparametric Functions
Machine Learning (CS)
Teaches robots to learn safely and fast.
JADAI: Jointly Amortizing Adaptive Design and Bayesian Inference
Machine Learning (Stat)
Finds best experiments to learn things faster.
Amortized In-Context Bayesian Posterior Estimation
Machine Learning (CS)
Teaches computers to guess answers faster.