Improving Retrieval Augmented Open-Domain Question-Answering with   Vectorized Contexts

Zhuo Chen; Xinyu Wang; Yong Jiang; Pengjun Xie; Fei Huang; Kewei Tu

arXiv:2404.02022·cs.CL·December 24, 2024·1 cites

Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts

Zhuo Chen, Xinyu Wang, Yong Jiang, Pengjun Xie, Fei Huang, Kewei Tu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a method to extend context coverage in open-domain question-answering by using a small encoder to create vectorized contexts, improving performance without significantly increasing computational costs.

Contribution

It presents a novel approach that enables large language models to handle longer contexts efficiently through vectorized encoding and cross-attention, enhancing open-domain QA capabilities.

Findings

01

Improved performance on multiple datasets after fine-tuning.

02

Extended context coverage with minimal additional computational requirements.

03

Effective in in-context learning scenarios.

Abstract

In the era of large language models, applying techniques such as Retrieval Augmented Generation can better address Open-Domain Question-Answering problems. Due to constraints including model sizes and computing resources, the length of context is often limited, and it becomes challenging to empower the model to cover overlong contexts while answering questions from open domains. This paper proposes a general and convenient method to covering longer contexts in Open-Domain Question-Answering tasks. It leverages a small encoder language model that effectively encodes contexts, and the encoding applies cross-attention with origin inputs. With our method, the origin language models can cover several times longer contexts while keeping the computing requirements close to the baseline. Our experiments demonstrate that after fine-tuning, there is improved performance across two held-in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alibaba-nlp/vec-ra-odqa
jaxOfficial

Videos

Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts· underline

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Intelligent Tutoring Systems and Adaptive Learning