Improving LLMs with a knowledge from databases
Petr M\'a\v{s}a

TL;DR
This paper introduces a novel method that enhances large language models' answers by integrating a ruleset derived from databases using interpretable association rules, improving accuracy and safety.
Contribution
The paper proposes a new approach combining association rule mining with RAG techniques to improve LLM responses based on structured data.
Findings
Significant improvement over ChatGPT in dataset-based questions
Method effectively integrates rules into LLMs via RAG
Potential for further enhancements with additional patterns
Abstract
Large language models (LLMs) are achieving significant progress almost every moment now. Many advanced techniques have been introduced and widely accepted, like retrieval-augmentation generation (RAG), agents, and tools. Tools can query the database to answer questions from structured data files or perform groupings or other statistics. This unlocks huge opportunities, such as it can answer any question, but also poses threats, such as safety, because there is no control over the commands that are created. We would like to discuss whether we can create a new method that improves answers based on dataset/database via some interpretable ML methods, namely enhanced association rules. The advantage would be if the method can be also used in some safe technique like RAG. Association rules have a sound history. Since the introduction of CN2 and aproiri, many enhancements have been made. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Law
