Score: 0

Valid Survey Simulations with Limited Human Data: The Roles of Prompting, Fine-Tuning, and Rectification

Published: October 13, 2025 | arXiv ID: 2510.11408v1

By: Stefan Krsteski , Giuseppe Russo , Serina Chang and more

Potential Business Impact:

Makes surveys cheaper and more accurate.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Surveys provide valuable insights into public opinion and behavior, but their execution is costly and slow. Large language models (LLMs) have been proposed as a scalable, low-cost substitute for human respondents, but their outputs are often biased and yield invalid estimates. We study the interplay between synthesis methods that use LLMs to generate survey responses and rectification methods that debias population estimates, and explore how human responses are best allocated between them. Using two panel surveys with questions on nutrition, politics, and economics, we find that synthesis alone introduces substantial bias (24-86%), whereas combining it with rectification reduces bias below 5% and increases effective sample size by up to 14%. Overall, we challenge the common practice of using all human responses for fine-tuning, showing that under a fixed budget, allocating most to rectification results in far more effective estimation.

Learning from Convenience Samples: A Case Study on Fine-Tuning LLMs for Survey Non-response in the German Longitudinal Election Study

Computers and Society

Helps surveys fill in missing answers accurately.

29 Sep 2025 1

89%

Emulating Public Opinion: A Proof-of-Concept of AI-Generated Synthetic Survey Responses for the Chilean Case

Computation and Language

Computers can answer survey questions like people.

11 Sep 2025 0

89%

Can Finetuing LLMs on Small Human Samples Increase Heterogeneity, Alignment, and Belief-Action Coherence?

Computation and Language

Makes computers act more like people in studies.

26 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇭 Switzerland

Page Count

19 pages

Valid Survey Simulations with Limited Human Data: The Roles of Prompting, Fine-Tuning, and Rectification

Makes surveys cheaper and more accurate.

Technical Abstract

Learning from Convenience Samples: A Case Study on Fine-Tuning LLMs for Survey Non-response in the German Longitudinal Election Study

Emulating Public Opinion: A Proof-of-Concept of AI-Generated Synthetic Survey Responses for the Chilean Case

Can Finetuing LLMs on Small Human Samples Increase Heterogeneity, Alignment, and Belief-Action Coherence?