CrowdLLM: Building LLM-Based Digital Populations Augmented with Generative Models
By: Ryan Feng Lin , Keyu Tian , Hanming Zheng and more
Potential Business Impact:
Creates realistic digital crowds for cheaper studies.
The emergence of large language models (LLMs) has sparked much interest in creating LLM-based digital populations that can be applied to many applications such as social simulation, crowdsourcing, marketing, and recommendation systems. A digital population can reduce the cost of recruiting human participants and alleviate many concerns related to human subject study. However, research has found that most of the existing works rely solely on LLMs and could not sufficiently capture the accuracy and diversity of a real human population. To address this limitation, we propose CrowdLLM that integrates pretrained LLMs and generative models to enhance the diversity and fidelity of the digital population. We conduct theoretical analysis of CrowdLLM regarding its great potential in creating cost-effective, sufficiently representative, scalable digital populations that can match the quality of a real crowd. Comprehensive experiments are also conducted across multiple domains (e.g., crowdsourcing, voting, user rating) and simulation studies which demonstrate that CrowdLLM achieves promising performance in both accuracy and distributional fidelity to human data.
Similar Papers
Prompt Optimization Across Multiple Agents for Representing Diverse Human Populations
Artificial Intelligence
Makes AI show many different human ideas.
Integrating LLM in Agent-Based Social Simulation: Opportunities and Challenges
Artificial Intelligence
Lets computer characters act more like real people.
Population-Aligned Persona Generation for LLM-based Social Simulation
Computation and Language
Creates realistic people for computer simulations.