Techniques to Improve Q&A Accuracy with Transformer-based models on   Large Complex Documents

Chejui Liao; Tabish Maniar; Sravanajyothi N; Anantha Sharma

arXiv:2009.12695·cs.CL·December 18, 2024

Techniques to Improve Q&A Accuracy with Transformer-based models on Large Complex Documents

Chejui Liao, Tabish Maniar, Sravanajyothi N, Anantha Sharma

PDF

Open Access

TL;DR

This paper evaluates different text processing techniques and their combinations to enhance the accuracy of transformer-based Q&A systems on large, complex documents, identifying the most effective methods for improved performance.

Contribution

It systematically analyzes and identifies the best combination of text simplification and encoding techniques that significantly improve transformer-based Q&A accuracy.

Findings

01

Optimal technique combination improves accuracy statistically

02

Simplified text leads to more relevant responses

03

Certain encodings enhance transformer performance

Abstract

This paper discusses the effectiveness of various text processing techniques, their combinations, and encodings to achieve a reduction of complexity and size in a given text corpus. The simplified text corpus is sent to BERT (or similar transformer based models) for question and answering and can produce more relevant responses to user queries. This paper takes a scientific approach to determine the benefits and effectiveness of various techniques and concludes a best-fit combination that produces a statistically significant improvement in accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExpert finding and Q&A systems · Topic Modeling · Advanced Text Analysis Techniques

MethodsLinear Layer · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Dropout · Linear Warmup With Linear Decay · Layer Normalization · Attention Dropout · WordPiece · Weight Decay