Using LLMs as prompt modifier to avoid biases in AI image generators
By: René Peinl
Potential Business Impact:
Makes AI art show more kinds of people.
This study examines how Large Language Models (LLMs) can reduce biases in text-to-image generation systems by modifying user prompts. We define bias as a model's unfair deviation from population statistics given neutral prompts. Our experiments with Stable Diffusion XL, 3.5 and Flux demonstrate that LLM-modified prompts significantly increase image diversity and reduce bias without the need to change the image generators themselves. While occasionally producing results that diverge from original user intent for elaborate prompts, this approach generally provides more varied interpretations of underspecified requests rather than superficial variations. The method works particularly well for less advanced image generators, though limitations persist for certain contexts like disability representation. All prompts and generated images are available at https://iisys-hof.github.io/llm-prompt-img-gen/
Similar Papers
Does the Prompt-based Large Language Model Recognize Students' Demographics and Introduce Bias in Essay Scoring?
Computation and Language
AI writing grader unfairly scores non-native speakers.
Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models
CV and Pattern Recognition
Fixes AI art to be less biased.
Investigating the Effects of Cognitive Biases in Prompts on Large Language Model Outputs
Computation and Language
Fixes AI to stop believing fake information.