Score: 0

Once Upon a Time: Interactive Learning for Storytelling with Small Language Models

Published: September 19, 2025 | arXiv ID: 2509.15714v1

By: Jonas Mayer Martins , Ali Hamza Bashir , Muhammad Rehan Khalid and more

Potential Business Impact:

Teaches computers to write stories with less data.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Children efficiently acquire language not just by listening, but by interacting with others in their social environment. Conversely, large language models are typically trained with next-word prediction on massive amounts of text. Motivated by this contrast, we investigate whether language models can be trained with less data by learning not only from next-word prediction but also from high-level, cognitively inspired feedback. We train a student model to generate stories, which a teacher model rates on readability, narrative coherence, and creativity. By varying the amount of pretraining before the feedback loop, we assess the impact of this interactive learning on formal and functional linguistic competence. We find that the high-level feedback is highly data efficient: With just 1 M words of input in interactive learning, storytelling skills can improve as much as with 410 M words of next-word prediction.

Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning

Computation and Language

Teaches computers to learn language like kids.

6 Mar 2025 0

88%

Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora

Computation and Language

Teaches computers to learn language like babies.

10 Apr 2025 3

88%

Listening with Language Models: Using LLMs to Collect and Interpret Classroom Feedback

Computers and Society

AI chatbot helps teachers get better student feedback.

13 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇩🇪 Germany

Page Count

16 pages

Once Upon a Time: Interactive Learning for Storytelling with Small Language Models

Teaches computers to write stories with less data.

Technical Abstract

Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning

Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora

Listening with Language Models: Using LLMs to Collect and Interpret Classroom Feedback