Still Not There: Can LLMs Outperform Smaller Task-Specific Seq2Seq Models on the Poetry-to-Prose Conversion Task?

Kunal Kingkar Das; Manoj Balaji Jagadeeshan; Nallani Chakravartula Sahith; Jivnesh Sandhan; Pawan Goyal

arXiv:2511.08145·cs.CL·November 12, 2025

Still Not There: Can LLMs Outperform Smaller Task-Specific Seq2Seq Models on the Poetry-to-Prose Conversion Task?

Kunal Kingkar Das, Manoj Balaji Jagadeeshan, Nallani Chakravartula Sahith, Jivnesh Sandhan, Pawan Goyal

PDF

Open Access

TL;DR

This paper evaluates whether large language models can outperform smaller, specialized Sanskrit poetry-to-prose models, finding that domain-specific fine-tuning yields superior results, especially in complex, low-resource language tasks.

Contribution

It introduces a comprehensive comparison between instruction-tuned LLMs and a task-specific fine-tuned Seq2Seq model for Sanskrit poetry-to-prose conversion, highlighting the effectiveness of domain-specific fine-tuning.

Findings

01

Domain-specific fine-tuning of ByT5-Sanskrit outperforms instruction-driven LLMs.

02

Prompting strategies serve as effective alternatives when domain data is scarce.

03

The task-specific model generalizes well to out-of-domain data.

Abstract

Large Language Models (LLMs) are increasingly treated as universal, general-purpose solutions across NLP tasks, particularly in English. But does this assumption hold for low-resource, morphologically rich languages such as Sanskrit? We address this question by comparing instruction-tuned and in-context-prompted LLMs with smaller task-specific encoder-decoder models on the Sanskrit poetry-to-prose conversion task. This task is intrinsically challenging: Sanskrit verse exhibits free word order combined with rigid metrical constraints, and its conversion to canonical prose (anvaya) requires multi-step reasoning involving compound segmentation, dependency resolution, and syntactic linearisation. This makes it an ideal testbed to evaluate whether LLMs can surpass specialised models. For LLMs, we apply instruction fine-tuning on general-purpose models and design in-context learning templates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification