Score: 1

Clustering Discourses: Racial Biases in Short Stories about Women Generated by Large Language Models

Published: September 2, 2025 | arXiv ID: 2509.02834v1

By: Gustavo Bonil , João Gondim , Marina dos Santos and more

Potential Business Impact:

Shows how AI can spread unfair ideas about women.

Business Areas:
Women's Community and Lifestyle

This study investigates how large language models, in particular LLaMA 3.2-3B, construct narratives about Black and white women in short stories generated in Portuguese. From 2100 texts, we applied computational methods to group semantically similar stories, allowing a selection for qualitative analysis. Three main discursive representations emerge: social overcoming, ancestral mythification and subjective self-realization. The analysis uncovers how grammatically coherent, seemingly neutral texts materialize a crystallized, colonially structured framing of the female body, reinforcing historical inequalities. The study proposes an integrated approach, that combines machine learning techniques with qualitative, manual discourse analysis.

Country of Origin
🇧🇷 Brazil

Repos / Data Links

Page Count
12 pages

Category
Computer Science:
Computation and Language