Score: 1

In-Context Learning Is Provably Bayesian Inference: A Generalization Theory for Meta-Learning

Published: October 13, 2025 | arXiv ID: 2510.10981v1

By: Tomoya Wakayama, Taiji Suzuki

Potential Business Impact:

Teaches computers to learn new tasks faster.

Business Areas:

Semantic Search Internet Services

This paper develops a finite-sample statistical theory for in-context learning (ICL), analyzed within a meta-learning framework that accommodates mixtures of diverse task types. We introduce a principled risk decomposition that separates the total ICL risk into two orthogonal components: Bayes Gap and Posterior Variance. The Bayes Gap quantifies how well the trained model approximates the Bayes-optimal in-context predictor. For a uniform-attention Transformer, we derive a non-asymptotic upper bound on this gap, which explicitly clarifies the dependence on the number of pretraining prompts and their context length. The Posterior Variance is a model-independent risk representing the intrinsic task uncertainty. Our key finding is that this term is determined solely by the difficulty of the true underlying task, while the uncertainty arising from the task mixture vanishes exponentially fast with only a few in-context examples. Together, these results provide a unified view of ICL: the Transformer selects the optimal meta-algorithm during pretraining and rapidly converges to the optimal algorithm for the true task at test time.

In-Context Learning as Nonparametric Conditional Probability Estimation: Risk Bounds and Optimality

Machine Learning (Stat)

Teaches computers to learn from examples faster.

12 Aug 2025 0

92%

In-Context Learning as Nonparametric Conditional Probability Estimation: Risk Bounds and Optimality

Machine Learning (Stat)

Helps AI learn faster and better from examples.

12 Aug 2025 0

91%

Scaling Laws and In-Context Learning: A Unified Theoretical Framework

Machine Learning (CS)

Makes AI learn new things faster with more data.

9 Nov 2025 0

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

38 pages

In-Context Learning Is Provably Bayesian Inference: A Generalization Theory for Meta-Learning

Teaches computers to learn new tasks faster.

Technical Abstract

In-Context Learning as Nonparametric Conditional Probability Estimation: Risk Bounds and Optimality

In-Context Learning as Nonparametric Conditional Probability Estimation: Risk Bounds and Optimality

Scaling Laws and In-Context Learning: A Unified Theoretical Framework