Investigating Answerability of LLMs for Long-Form Question Answering
Meghana Moorthy Bhat, Rui Meng, Ye Liu, Yingbo Zhou, Semih Yavuz

TL;DR
This paper explores the answerability of large and open-source language models in long-form question answering by proposing a question-generation method from summaries, revealing significant performance gaps and context reliance issues.
Contribution
It introduces a novel question-generation approach from summaries to evaluate LLMs' reasoning in LFQA and highlights the performance disparities between ChatGPT and open-source models.
Findings
Generated questions from summaries challenge LLMs' reasoning.
Open-source LLMs rely less on context for questions from original documents.
Performance drops are significant for longer contexts (>1024 tokens).
Abstract
As we embark on a new era of LLMs, it becomes increasingly crucial to understand their capabilities, limitations, and differences. Toward making further progress in this direction, we strive to build a deeper understanding of the gaps between massive LLMs (e.g., ChatGPT) and smaller yet effective open-source LLMs and their distilled counterparts. To this end, we specifically focus on long-form question answering (LFQA) because it has several practical and impactful applications (e.g., troubleshooting, customer service, etc.) yet is still understudied and challenging for LLMs. We propose a question-generation method from abstractive summaries and show that generating follow-up questions from summaries of long documents can create a challenging setting for LLMs to reason and infer from long contexts. Our experimental results confirm that: (1) our proposed method of generating questions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
Methodstravel james · Focus
