Score: 0

LLMs as Policy-Agnostic Teammates: A Case Study in Human Proxy Design for Heterogeneous Agent Teams

Published: October 7, 2025 | arXiv ID: 2510.06151v1

By: Aju Ani Justus, Chris Baber

Potential Business Impact:

Computers learn to play games like people.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

A critical challenge in modelling Heterogeneous-Agent Teams is training agents to collaborate with teammates whose policies are inaccessible or non-stationary, such as humans. Traditional approaches rely on expensive human-in-the-loop data, which limits scalability. We propose using Large Language Models (LLMs) as policy-agnostic human proxies to generate synthetic data that mimics human decision-making. To evaluate this, we conduct three experiments in a grid-world capture game inspired by Stag Hunt, a game theory paradigm that balances risk and reward. In Experiment 1, we compare decisions from 30 human participants and 2 expert judges with outputs from LLaMA 3.1 and Mixtral 8x22B models. LLMs, prompted with game-state observations and reward structures, align more closely with experts than participants, demonstrating consistency in applying underlying decision criteria. Experiment 2 modifies prompts to induce risk-sensitive strategies (e.g. "be risk averse"). LLM outputs mirror human participants' variability, shifting between risk-averse and risk-seeking behaviours. Finally, Experiment 3 tests LLMs in a dynamic grid-world where the LLM agents generate movement actions. LLMs produce trajectories resembling human participants' paths. While LLMs cannot yet fully replicate human adaptability, their prompt-guided diversity offers a scalable foundation for simulating policy-agnostic teammates.

Large language models replicate and predict human cooperation across experiments in game theory

Artificial Intelligence

Makes computers act like people in games.

6 Nov 2025 0

92%

Large language models replicate and predict human cooperation across experiments in game theory

Artificial Intelligence

Makes computers act like people making choices.

6 Nov 2025 1

91%

Prompt Optimization Across Multiple Agents for Representing Diverse Human Populations

Artificial Intelligence

Makes AI show many different human ideas.

8 Oct 2025 0

View PDF Login to Bookmark

Page Count

8 pages

LLMs as Policy-Agnostic Teammates: A Case Study in Human Proxy Design for Heterogeneous Agent Teams

Computers learn to play games like people.

Technical Abstract

Large language models replicate and predict human cooperation across experiments in game theory

Large language models replicate and predict human cooperation across experiments in game theory

Prompt Optimization Across Multiple Agents for Representing Diverse Human Populations