Investigating Answerability of LLMs for Long-Form Question Answering

Meghana Moorthy Bhat; Rui Meng; Ye Liu; Yingbo Zhou; Semih Yavuz

arXiv:2309.08210·cs.CL·September 18, 2023·2 cites

Investigating Answerability of LLMs for Long-Form Question Answering

Meghana Moorthy Bhat, Rui Meng, Ye Liu, Yingbo Zhou, Semih Yavuz

PDF

Open Access

TL;DR

This paper explores the answerability of large and open-source language models in long-form question answering by proposing a question-generation method from summaries, revealing significant performance gaps and context reliance issues.

Contribution

It introduces a novel question-generation approach from summaries to evaluate LLMs' reasoning in LFQA and highlights the performance disparities between ChatGPT and open-source models.

Findings

01

Generated questions from summaries challenge LLMs' reasoning.

02

Open-source LLMs rely less on context for questions from original documents.

03

Performance drops are significant for longer contexts (>1024 tokens).

Abstract

As we embark on a new era of LLMs, it becomes increasingly crucial to understand their capabilities, limitations, and differences. Toward making further progress in this direction, we strive to build a deeper understanding of the gaps between massive LLMs (e.g., ChatGPT) and smaller yet effective open-source LLMs and their distilled counterparts. To this end, we specifically focus on long-form question answering (LFQA) because it has several practical and impactful applications (e.g., troubleshooting, customer service, etc.) yet is still understudied and challenging for LLMs. We propose a question-generation method from abstractive summaries and show that generating follow-up questions from summaries of long documents can create a challenging setting for LLMs to reason and infer from long contexts. Our experimental results confirm that: (1) our proposed method of generating questions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

Methodstravel james · Focus