Score: 0

Interpretable Probability Estimation with LLMs via Shapley Reconstruction

Published: January 14, 2026 | arXiv ID: 2601.09151v1

By: Yang Nan , Qihao Wen , Jiahao Wang and more

Large Language Models (LLMs) demonstrate potential to estimate the probability of uncertain events, by leveraging their extensive knowledge and reasoning capabilities. This ability can be applied to support intelligent decision-making across diverse fields, such as financial forecasting and preventive healthcare. However, directly prompting LLMs for probability estimation faces significant challenges: their outputs are often noisy, and the underlying predicting process is opaque. In this paper, we propose PRISM: Probability Reconstruction via Shapley Measures, a framework that brings transparency and precision to LLM-based probability estimation. PRISM decomposes an LLM's prediction by quantifying the marginal contribution of each input factor using Shapley values. These factor-level contributions are then aggregated to reconstruct a calibrated final estimate. In our experiments, we demonstrate PRISM improves predictive accuracy over direct prompting and other baselines, across multiple domains including finance, healthcare, and agriculture. Beyond performance, PRISM provides a transparent prediction pipeline: our case studies visualize how individual factors shape the final estimate, helping build trust in LLM-based decision support systems.

Measuring What LLMs Think They Do: SHAP Faithfulness and Deployability on Financial Tabular Classification

Machine Learning (CS)

Makes AI explain financial risks more honestly.

28 Nov 2025 2

88%

Evaluating the Use of Large Language Models as Synthetic Social Agents in Social Science Research

Artificial Intelligence

Makes AI better at guessing, not knowing for sure.

30 Sep 2025 0

87%

Document Valuation in LLM Summaries: A Cluster Shapley Approach

Computation and Language

Gives credit to sources used in AI summaries.

28 May 2025 1

View PDF Login to Bookmark

Interpretable Probability Estimation with LLMs via Shapley Reconstruction

Technical Abstract

Measuring What LLMs Think They Do: SHAP Faithfulness and Deployability on Financial Tabular Classification

Evaluating the Use of Large Language Models as Synthetic Social Agents in Social Science Research

Document Valuation in LLM Summaries: A Cluster Shapley Approach