Harnessing the Power of Semi-Structured Knowledge and LLMs with Triplet-Based Prefiltering for Question Answering
Derian Boer, Fabian Koch, Stefan Kramer

TL;DR
This paper introduces 4StepFocus, a pipeline that enhances LLM question answering by integrating semi-structured knowledge through triplet-based filtering, improving accuracy and traceability.
Contribution
The paper presents a novel multi-step method combining triplet extraction, knowledge graph filtering, and ranking to augment LLMs with external knowledge for better QA performance.
Findings
Significant performance improvement over state-of-the-art methods.
Effective integration of semi-structured and unstructured data.
Demonstrated on medical, product, and academic datasets.
Abstract
Large Language Models (LLMs) frequently lack domain-specific knowledge and even fine-tuned models tend to hallucinate. Hence, more reliable models that can include external knowledge are needed. We present a pipeline, 4StepFocus, and specifically a preprocessing step, that can substantially improve the answers of LLMs. This is achieved by providing guided access to external knowledge making use of the model's ability to capture relational context and conduct rudimentary reasoning by themselves. The method narrows down potentially correct answers by triplets-based searches in a semi-structured knowledge base in a direct, traceable fashion, before switching to latent representations for ranking those candidates based on unstructured data. This distinguishes it from related methods that are purely based on latent representations. 4StepFocus consists of the steps: 1) Triplet generation for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
MethodsSparse Evolutionary Training · Balanced Selection
