Reasoning Under Uncertainty: Exploring Probabilistic Reasoning Capabilities of LLMs
By: Mobina Pournemat , Keivan Rezaei , Gaurang Sriramanan and more
Potential Business Impact:
Helps computers understand and use probability better.
Despite widespread success in language understanding and generation, large language models (LLMs) exhibit unclear and often inconsistent behavior when faced with tasks that require probabilistic reasoning. In this work, we present the first comprehensive study of the reasoning capabilities of LLMs over explicit discrete probability distributions. Given observations from a probability distribution, we evaluate models on three carefully designed tasks, mode identification, maximum likelihood estimation, and sample generation, by prompting them to provide responses to queries about either the joint distribution or its conditionals. These tasks thus probe a range of probabilistic skills, including frequency analysis, marginalization, and generative behavior. Through comprehensive empirical evaluations, we demonstrate that there exists a clear performance gap between smaller and larger models, with the latter demonstrating stronger inference and surprising capabilities in sample generation. Furthermore, our investigations reveal notable limitations, including sensitivity to variations in the notation utilized to represent probabilistic outcomes and performance degradation of over 60% as context length increases. Together, our results provide a detailed understanding of the probabilistic reasoning abilities of LLMs and identify key directions for future improvement.
Similar Papers
Exploring the Potential for Large Language Models to Demonstrate Rational Probabilistic Beliefs
Artificial Intelligence
Makes AI understand "maybe" better for trust.
On the Reasoning Capacity of AI Models and How to Quantify It
Artificial Intelligence
Shows how AI *really* thinks, not just gets answers.
Reasoning Capabilities and Invariability of Large Language Models
Computation and Language
Tests if computers can think logically.