Coarse-to-Fine Memory Matching for Joint Retrieval and Classification
Allen Schmaltz, Andrew Beam

TL;DR
This paper introduces a novel end-to-end language model that unifies retrieval and classification using a coarse-to-fine memory matching approach, significantly improving fact verification accuracy with a single BERT model.
Contribution
The work presents a new joint retrieval and classification method with a coarse-to-fine memory matching, enhancing accuracy and enabling model behavior updates through memory and exemplar databases.
Findings
Achieves higher classification accuracy than knowledge-base-only models.
Uses a single BERT model with memory layers for joint retrieval and classification.
Enables updating model behavior via retrieved information and exemplar database.
Abstract
We present a novel end-to-end language model for joint retrieval and classification, unifying the strengths of bi- and cross- encoders into a single language model via a coarse-to-fine memory matching search procedure for learning and inference. Evaluated on the standard blind test set of the FEVER fact verification dataset, classification accuracy is significantly higher than approaches that only rely on the language model parameters as a knowledge base, and approaches some recent multi-model pipeline systems, using only a single BERT base model augmented with memory layers. We further demonstrate how coupled retrieval and classification can be leveraged to identify low confidence instances, and we extend exemplar auditing to this setting for analyzing and constraining the model. As a result, our approach yields a means of updating language model behavior through two distinct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning
MethodsLinear Layer · Residual Connection · Adam · Dense Connections · Refunds@Expedia|||How do I get a full refund from Expedia? · Weight Decay · Softmax · Multi-Head Attention · Dropout · Attention Is All You Need
