Speeding Up Question Answering Task of Language Models via Inverted   Index

Xiang Ji; Yesim Sungu-Eryilmaz; Elaheh Momeni; Reza; Rawassizadeh

arXiv:2210.13578·cs.CL·October 26, 2022

Speeding Up Question Answering Task of Language Models via Inverted Index

Xiang Ji, Yesim Sungu-Eryilmaz, Elaheh Momeni, Reza, Rawassizadeh

PDF

Open Access

TL;DR

This paper introduces an inverted index mechanism to significantly enhance the efficiency of large language model-based question answering systems, reducing response time and improving answer quality for closed-domain questions.

Contribution

The study proposes a novel inverted indexing approach to accelerate LLM question answering, demonstrating substantial efficiency gains and improved answer quality.

Findings

01

Average response time reduced by 97.44%

02

BLEU score increased by 0.23

03

Efficiency improvements enable practical deployment of LLMs in real-world applications

Abstract

Natural language processing applications, such as conversational agents and their question-answering capabilities, are widely used in the real world. Despite the wide popularity of large language models (LLMs), few real-world conversational agents take advantage of LLMs. Extensive resources consumed by LLMs disable developers from integrating them into end-user applications. In this study, we leverage an inverted indexing mechanism combined with LLMs to improve the efficiency of question-answering models for closed-domain questions. Our experiments show that using the index improves the average response time by 97.44%. In addition, due to the reduced search scope, the average BLEU score improved by 0.23 while using the inverted index.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications