AyurParam: A State-of-the-Art Bilingual Language Model for Ayurveda
Mohd Nauman, Sravan Gvm, Vijay Devane, Shyam Pawar, Viraj Thakur, Kundeshwar Pundalik, Piyush Sawarkar, Rohit Saluja, Maunendra Desarkar, Ganesh Ramakrishnan

TL;DR
AyurParam-2.9B is a specialized bilingual language model tailored for Ayurveda, demonstrating superior performance over comparable models through extensive domain-specific training and high-quality data.
Contribution
The paper introduces AyurParam-2.9B, a novel domain-specific bilingual LLM for Ayurveda, trained on a curated dataset to improve accuracy and cultural relevance in medical AI applications.
Findings
Outperforms open-source models of similar size on Ayurvedic tasks
Achieves competitive results against larger models in Ayurveda-related benchmarks
Highlights importance of domain-specific data and supervision for specialized AI
Abstract
Current large language models excel at broad, general-purpose tasks, but consistently underperform when exposed to highly specialized domains that require deep cultural, linguistic, and subject-matter expertise. In particular, traditional medical systems such as Ayurveda embody centuries of nuanced textual and clinical knowledge that mainstream LLMs fail to accurately interpret or apply. We introduce AyurParam-2.9B, a domain-specialized, bilingual language model fine-tuned from Param-1-2.9B using an extensive, expertly curated Ayurveda dataset spanning classical texts and clinical guidance. AyurParam's dataset incorporates context-aware, reasoning, and objective-style Q&A in both English and Hindi, with rigorous annotation protocols for factual precision and instructional clarity. Benchmarked on BhashaBench-Ayur, AyurParam not only surpasses all open-source instruction-tuned models in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraditional Chinese Medicine Studies · Natural Language Processing Techniques · Machine Learning in Healthcare
