From Models to Microtheories: Distilling a Model's Topical Knowledge for   Grounded Question Answering

Nathaniel Weir; Bhavana Dalvi Mishra; Orion Weller; Oyvind Tafjord,; Sam Hornstein; Alexander Sabol; Peter Jansen; Benjamin Van Durme; Peter Clark

arXiv:2412.17701·cs.CL·December 25, 2024

From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question Answering

Nathaniel Weir, Bhavana Dalvi Mishra, Orion Weller, Oyvind Tafjord,, Sam Hornstein, Alexander Sabol, Peter Jansen, Benjamin Van Durme, Peter Clark

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to extract and distill a language model's core topical knowledge, called microtheories, which improve answer grounding and accuracy while enhancing interpretability and trust.

Contribution

The paper proposes a novel approach to create concise, general microtheories from language models that enhance answer grounding, accuracy, and interpretability in grounded question answering.

Findings

01

Microtheories improve answer grounding by up to +8%.

02

Distilled microtheories contain more topically critical facts.

03

Microtheories enhance model performance and trustworthiness.

Abstract

Recent reasoning methods (e.g., chain-of-thought, entailment reasoning) help users understand how language models (LMs) answer a single question, but they do little to reveal the LM's overall understanding, or "theory," about the question's topic, making it still hard to trust the model. Our goal is to materialize such theories - here called microtheories (a linguistic analog of logical microtheories) - as a set of sentences encapsulating an LM's core knowledge about a topic. These statements systematically work together to entail answers to a set of questions to both engender trust and improve performance. Our approach is to first populate a knowledge store with (model-generated) sentences that entail answers to training questions and then distill those down to a core microtheory that is concise, general, and non-redundant. We show that, when added to a general corpus (e.g.,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nweir127/microtheories
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExpert finding and Q&A systems · Speech and dialogue systems

MethodsSparse Evolutionary Training