Score: 1

Using Reinforcement Learning to Train Large Language Models to Explain Human Decisions

Published: May 16, 2025 | arXiv ID: 2505.11614v1

By: Jian-Qiao Zhu , Hanbo Xie , Dilip Arumugam and more

BigTech Affiliations: Princeton University

Potential Business Impact:

Explains why people make risky choices.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

A central goal of cognitive modeling is to develop models that not only predict human behavior but also provide insight into the underlying cognitive mechanisms. While neural network models trained on large-scale behavioral data often achieve strong predictive performance, they typically fall short in offering interpretable explanations of the cognitive processes they capture. In this work, we explore the potential of pretrained large language models (LLMs) to serve as dual-purpose cognitive models--capable of both accurate prediction and interpretable explanation in natural language. Specifically, we employ reinforcement learning with outcome-based rewards to guide LLMs toward generating explicit reasoning traces for explaining human risky choices. Our findings demonstrate that this approach produces high-quality explanations alongside strong quantitative predictions of human decisions.

Reinforcement Learning Meets Large Language Models: A Survey of Advancements and Applications Across the LLM Lifecycle

Computation and Language

Teaches computers to think and follow instructions better.

20 Sep 2025 1

90%

From Stimuli to Minds: Enhancing Psychological Reasoning in LLMs via Bilateral Reinforcement Learning

Databases

Teaches computers to understand feelings and thoughts.

4 Aug 2025 0

90%

From Stimuli to Minds: Enhancing Psychological Reasoning in LLMs via Bilateral Reinforcement Learning

Databases

Helps computers understand feelings and thoughts better.

4 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

19 pages

Using Reinforcement Learning to Train Large Language Models to Explain Human Decisions

Explains why people make risky choices.

Technical Abstract

Reinforcement Learning Meets Large Language Models: A Survey of Advancements and Applications Across the LLM Lifecycle

From Stimuli to Minds: Enhancing Psychological Reasoning in LLMs via Bilateral Reinforcement Learning

From Stimuli to Minds: Enhancing Psychological Reasoning in LLMs via Bilateral Reinforcement Learning