The Need for a Socially-Grounded Persona Framework for User Simulation
By: Pranav Narayanan Venkit , Yu Li , Yada Pruksachatkun and more
Potential Business Impact:
Makes AI understand people better by knowing their thoughts.
Synthetic personas are widely used to condition large language models (LLMs) for social simulation, yet most personas are still constructed from coarse sociodemographic attributes or summaries. We revisit persona creation by introducing SCOPE, a socially grounded framework for persona construction and evaluation, built from a 141-item, two-hour sociopsychological protocol collected from 124 U.S.-based participants. Across seven models, we find that demographic-only personas are a structural bottleneck: demographics explain only ~1.5% of variance in human response similarity. Adding sociopsychological facets improves behavioral prediction and reduces over-accentuation, and non-demographic personas based on values and identity achieve strong alignment with substantially lower bias. These trends generalize to SimBench (441 aligned questions), where SCOPE personas outperform default prompting and NVIDIA Nemotron personas, and SCOPE augmentation improves Nemotron-based personas. Our results indicate that persona quality depends on sociopsychological structure rather than demographic templates or summaries.
Similar Papers
Population-Aligned Persona Generation for LLM-based Social Simulation
Computation and Language
Creates realistic people for computer simulations.
Whose Personae? Synthetic Persona Experiments in LLM Research and Pathways to Transparency
Computers and Society
Makes AI understand people better and more fairly.
Polypersona: Persona-Grounded LLM for Synthetic Survey Responses
Computation and Language
Makes computers answer surveys like real people.