From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question Answering
Nathaniel Weir, Bhavana Dalvi Mishra, Orion Weller, Oyvind Tafjord,, Sam Hornstein, Alexander Sabol, Peter Jansen, Benjamin Van Durme, Peter Clark

TL;DR
This paper introduces a method to extract and distill a language model's core topical knowledge, called microtheories, which improve answer grounding and accuracy while enhancing interpretability and trust.
Contribution
The paper proposes a novel approach to create concise, general microtheories from language models that enhance answer grounding, accuracy, and interpretability in grounded question answering.
Findings
Microtheories improve answer grounding by up to +8%.
Distilled microtheories contain more topically critical facts.
Microtheories enhance model performance and trustworthiness.
Abstract
Recent reasoning methods (e.g., chain-of-thought, entailment reasoning) help users understand how language models (LMs) answer a single question, but they do little to reveal the LM's overall understanding, or "theory," about the question's topic, making it still hard to trust the model. Our goal is to materialize such theories - here called microtheories (a linguistic analog of logical microtheories) - as a set of sentences encapsulating an LM's core knowledge about a topic. These statements systematically work together to entail answers to a set of questions to both engender trust and improve performance. Our approach is to first populate a knowledge store with (model-generated) sentences that entail answers to training questions and then distill those down to a core microtheory that is concise, general, and non-redundant. We show that, when added to a general corpus (e.g.,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExpert finding and Q&A systems · Speech and dialogue systems
MethodsSparse Evolutionary Training
