Score: 0

Can we use LLMs to bootstrap reinforcement learning? -- A case study in digital health behavior change

Published: November 19, 2025 | arXiv ID: 2511.17630v1

By: Nele Albers , Esra Cemre Su de Groot , Loes Keijsers and more

Potential Business Impact:

Helps apps learn how to help people change habits.

Business Areas:

Machine Learning Artificial Intelligence, Data and Analytics, Software

Personalizing digital applications for health behavior change is a promising route to making them more engaging and effective. This especially holds for approaches that adapt to users and their specific states (e.g., motivation, knowledge, wants) over time. However, developing such approaches requires making many design choices, whose effectiveness is difficult to predict from literature and costly to evaluate in practice. In this work, we explore whether large language models (LLMs) can be used out-of-the-box to generate samples of user interactions that provide useful information for training reinforcement learning models for digital behavior change settings. Using real user data from four large behavior change studies as comparison, we show that LLM-generated samples can be useful in the absence of real data. Comparisons to the samples provided by human raters further show that LLM-generated samples reach the performance of human raters. Additional analyses of different prompting strategies including shorter and longer prompt variants, chain-of-thought prompting, and few-shot prompting show that the relative effectiveness of different strategies depends on both the study and the LLM with also relatively large differences between prompt paraphrases alone. We provide recommendations for how LLM-generated samples can be useful in practice.

Are LLMs The Way Forward? A Case Study on LLM-Guided Reinforcement Learning for Decentralized Autonomous Driving

Machine Learning (CS)

Helps self-driving cars learn safer highway driving.

16 Nov 2025 0

90%

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Computation and Language

Teaches AI to learn and solve problems better.

18 Nov 2025 1

90%

Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping

Computation and Language

Helps online stores act like you.

8 Oct 2025 0

View PDF Login to Bookmark

Page Count

49 pages

Can we use LLMs to bootstrap reinforcement learning? -- A case study in digital health behavior change

Helps apps learn how to help people change habits.

Technical Abstract

Are LLMs The Way Forward? A Case Study on LLM-Guided Reinforcement Learning for Decentralized Autonomous Driving

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping