Score: 2

Still Not There: Can LLMs Outperform Smaller Task-Specific Seq2Seq Models on the Poetry-to-Prose Conversion Task?

Published: November 11, 2025 | arXiv ID: 2511.08145v1

By: Kunal Kingkar Das , Manoj Balaji Jagadeeshan , Nallani Chakravartula Sahith and more

Potential Business Impact:

Makes old Sanskrit poems understandable as normal sentences.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Large Language Models (LLMs) are increasingly treated as universal, general-purpose solutions across NLP tasks, particularly in English. But does this assumption hold for low-resource, morphologically rich languages such as Sanskrit? We address this question by comparing instruction-tuned and in-context-prompted LLMs with smaller task-specific encoder-decoder models on the Sanskrit poetry-to-prose conversion task. This task is intrinsically challenging: Sanskrit verse exhibits free word order combined with rigid metrical constraints, and its conversion to canonical prose (anvaya) requires multi-step reasoning involving compound segmentation, dependency resolution, and syntactic linearisation. This makes it an ideal testbed to evaluate whether LLMs can surpass specialised models. For LLMs, we apply instruction fine-tuning on general-purpose models and design in-context learning templates grounded in Paninian grammar and classical commentary heuristics. For task-specific modelling, we fully fine-tune a ByT5-Sanskrit Seq2Seq model. Our experiments show that domain-specific fine-tuning of ByT5-Sanskrit significantly outperforms all instruction-driven LLM approaches. Human evaluation strongly corroborates this result, with scores exhibiting high correlation with Kendall's Tau scores. Additionally, our prompting strategies provide an alternative to fine-tuning when domain-specific verse corpora are unavailable, and the task-specific Seq2Seq model demonstrates robust generalisation on out-of-domain evaluations.

From Plain Text to Poetic Form: Generating Metrically-Constrained Sanskrit Verses

Computation and Language

Teaches computers to write old Sanskrit poems.

1 Jun 2025 2

90%

State of the Art in Text Classification for South Slavic Languages: Fine-Tuning or Prompting?

Computation and Language

New AI understands many languages for tasks.

11 Nov 2025 2

89%

Regional Tiny Stories: Using Small Models to Compare Language Learning and Tokenizer Performance

Computation and Language

Helps small computers understand Indian languages.

7 Apr 2025 1

View PDF Login to Bookmark

Country of Origin

🇯🇵 Japan

Repos / Data Links

github.com github.com

Page Count

15 pages

Still Not There: Can LLMs Outperform Smaller Task-Specific Seq2Seq Models on the Poetry-to-Prose Conversion Task?

Makes old Sanskrit poems understandable as normal sentences.

Technical Abstract

From Plain Text to Poetic Form: Generating Metrically-Constrained Sanskrit Verses

State of the Art in Text Classification for South Slavic Languages: Fine-Tuning or Prompting?

Regional Tiny Stories: Using Small Models to Compare Language Learning and Tokenizer Performance