Indirect Question Answering in English, German and Bavarian: A Challenging Task for High- and Low-Resource Languages Alike
Miriam Winkler, Verena Blaschke, Barbara Plank

TL;DR
This paper explores the challenging task of Indirect Question Answering across multiple languages, presenting new corpora and analyzing the difficulties faced by multilingual transformer models in low- and high-resource settings.
Contribution
It introduces two multilingual IQA datasets, analyzes the challenges of the task, and evaluates transformer models, highlighting issues like low performance and overfitting.
Findings
IQA performance is low across all tested languages.
Large training datasets improve IQA results.
GPT-4o-mini generates insufficiently pragmatic IQA data.
Abstract
Indirectness is a common feature of daily communication, yet is underexplored in NLP research for both low-resource as well as high-resource languages. Indirect Question Answering (IQA) aims at classifying the polarity of indirect answers. In this paper, we present two multilingual corpora for IQA of varying quality that both cover English, Standard German and Bavarian, a German dialect without standard orthography: InQA+, a small high-quality evaluation dataset with hand-annotated labels, and GenIQA, a larger training dataset, that contains artificial data generated by GPT-4o-mini. We find that IQA is a pragmatically hard task that comes with various challenges, based on several experiment variations with multilingual transformer models (mBERT, XLM-R and mDeBERTa). We suggest and employ recommendations to tackle these challenges. Our results reveal low performance, even for English,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
