Key-Augmented Neural Triggers for Knowledge Sharing
Alex Wolf, Marco Edoardo Palma, Pooja Rani, Harald C. Gall

TL;DR
KANT introduces knowledge anchors into LLMs to improve code understanding and sharing by reducing fragmentation, increasing efficiency, and enabling on-premise deployment, validated through human evaluation and speed improvements.
Contribution
The paper presents KANT, a novel method embedding knowledge anchors into LLMs for enhanced code comprehension and efficient, private deployment, addressing key limitations of existing approaches.
Findings
Synthetic data improves intent coverage and quality.
KANT reduces inference latency by up to 85%.
Achieves over 60% preference in human evaluations.
Abstract
Repository-level code comprehension and knowledge sharing remain core challenges in software engineering. Large language models (LLMs) have shown promise by generating explanations of program structure and logic. However, these approaches still face limitations: First, relevant knowledge is distributed across multiple files within a repository, aka semantic fragmentation. Second, retrieval inefficiency and attention saturation degrade performance in RAG pipelines, where long, unaligned contexts overwhelm attention. Third, repository specific training data is scarce and often outdated. Finally, proprietary LLMs hinder industrial adoption due to privacy and deployment constraints. To address these issues, we propose Key-Augmented Neural Triggers (KANT), a novel approach that embeds knowledge anchors into both training and inference. Unlike prior methods, KANT enables internal access to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
