Score: 0

Large language models for folktale type automation based on motifs: Cinderella case study

Published: October 21, 2025 | arXiv ID: 2510.18561v1

By: Tjaša Arčon, Marko Robnik-Šikonja, Polona Tratnik

Potential Business Impact:

Finds story patterns in thousands of fairy tales.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Artificial intelligence approaches are being adapted to many research areas, including digital humanities. We built a methodology for large-scale analyses in folkloristics. Using machine learning and natural language processing, we automatically detected motifs in a large collection of Cinderella variants and analysed their similarities and differences with clustering and dimensionality reduction. The results show that large language models detect complex interactions in tales, enabling computational analysis of extensive text collections and facilitating cross-lingual comparisons.

Country of Origin
🇸🇮 Slovenia

Page Count
22 pages

Category
Computer Science:
Computation and Language