LLM-Assisted Question-Answering on Technical Documents Using Structured Data-Aware Retrieval Augmented Generation
Shadman Sobhan, Mohammad Ariful Haque

TL;DR
This paper introduces a retrieval-augmented generation pipeline tailored for technical documents with structured data like tables and images, enhancing accuracy and relevance in question-answering tasks.
Contribution
It presents a novel RAG pipeline that effectively handles complex structured data in technical documents, incorporating a specialized reranker trained with RAFT for improved context understanding.
Findings
Achieves 94-96% faithfulness scores in evaluations.
Outperforms general RAG pipelines on table-based questions.
Effectively handles both scanned and searchable document formats.
Abstract
Large Language Models (LLMs) are capable of natural language understanding and generation. But they face challenges such as hallucination and outdated knowledge. Fine-tuning is one possible solution, but it is resource-intensive and must be repeated with every data update. Retrieval-Augmented Generation (RAG) offers an efficient solution by allowing LLMs to access external knowledge sources. However, traditional RAG pipelines struggle with retrieving information from complex technical documents with structured data such as tables and images. In this work, we propose a RAG pipeline, capable of handling tables and images in documents, for technical documents that support both scanned and searchable formats. Its retrieval process combines vector similarity search with a fine-tuned reranker based on Gemma-2-9b-it. The reranker is trained using RAFT (Retrieval-Augmented Fine-Tuning) on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Handwritten Text Recognition Techniques · Multimodal Machine Learning Applications
