VIRAASAT: Traversing Novel Paths for Indian Cultural Reasoning
Harshul Raj Surana, Arijit Maji, Aryan Vats, Akash Ghosh, Sriparna Saha, Amit Sheth

TL;DR
VIRAASAT introduces a culturally rich, multi-hop question-answering dataset for Indian culture, highlighting current LLM limitations and proposing a novel symbolic reasoning framework to improve cultural reasoning capabilities.
Contribution
The paper presents VIRAASAT, a large-scale, semi-automated dataset for Indian cultural reasoning and introduces SCoM, a new framework that enhances LLMs' ability to perform multi-hop cultural reasoning.
Findings
SOTA LLMs struggle with low-probability cultural facts.
SCoM improves reasoning accuracy by up to 20%.
VIRAASAT covers 13 attributes across all Indian states and territories.
Abstract
Large Language Models (LLMs) have made significant progress in reasoning tasks across various domains such as mathematics and coding. However, their performance deteriorates in tasks requiring rich socio-cultural knowledge and diverse local contexts, particularly those involving Indian Culture. Existing Cultural benchmarks are (i) Manually crafted, (ii) contain single-hop questions testing factual recall, and (iii) prohibitively costly to scale, leaving this deficiency largely unmeasured. To address this, we introduce VIRAASAT, a novel, semi-automated multi-hop approach for generating cultural specific multi-hop Question-Answering dataset for Indian culture. VIRAASAT leverages a Knowledge Graph comprising more than 700 expert-curated cultural artifacts, covering 13 key attributes of Indian culture (history, festivals, etc). VIRAASAT spans all 28 states and 8 Union Territories, yielding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Digital Economy · Advanced Graph Neural Networks · Topic Modeling
