KoRe: Compact Knowledge Representations for Large Language Models
Davide Cavicchini, Fausto Giunchiglia, Jacopo Staiano

TL;DR
KoRe introduces a method to encode 1-hop sub-graphs into compact tokens, enabling efficient integration of knowledge graphs into large language models with reduced token usage and maintained performance.
Contribution
The paper presents KoRe, a novel approach for embedding knowledge graph sub-graphs into LLMs using discrete tokens, reducing token count and improving knowledge grounding.
Findings
Achieved up to 10x reduction in token usage.
Demonstrated competitive performance on three benchmarks.
Showed effective grounding of LLMs with compact KG representations.
Abstract
Modern Large Language Models (LLMs) have shown impressive performances in user-facing tasks such as question answering, as well as consistent improvements in reasoning capabilities. Still, the way these models encode knowledge seems inherently flawed: by design, LLMs encode world-knowledge within their parameters. This way of representing knowledge is inherently opaque, difficult to debug and update, and prone to hallucinations. On the other hand, Knowledge Graphs can provide human-readable and easily editable world knowledge representations, and their application in knowledge-intensive tasks has consistently proven beneficial to downstream performance. Nonetheless, current integration techniques require extensive retraining or finetuning. To overcome this issue, we introduce KoRe, a methodology to encode 1-hop sub-graphs into compact discrete knowledge tokens and inject them into a LLM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
