TL;DR
SAKE introduces a reinforcement learning framework enabling small language models to autonomously retrieve and extrapolate structured knowledge, significantly improving reasoning in biomedical and commonsense domains.
Contribution
It presents a novel RL-based agentic framework with tool-augmented retrieval for structured knowledge extrapolation in LLMs, outperforming larger models.
Findings
SAKE fine-tuned Qwen2.5-7B surpasses GPT-3.5-Turbo on benchmarks.
Reduces token usage by over 90%.
Enables end-to-end learning of associative reasoning with small models.
Abstract
Knowledge extrapolation is the process of inferring novel information by combining and extending existing knowledge that is explicitly available. It is essential for solving complex questions in specialized domains where retrieving comprehensive external knowledge is impractical. We propose SAKE (Structured Agentic Knowledge Extrapolation), a RL powered agentic framework that trains LLMs to autonomously retrieve and extrapolate structured knowledge through tool-augmented reinforcement learning. SAKE defines two external KG tools: entity group construction and cross-group triplet retrieval. The model learns to interleave these 2 retrieval tools during a three-turn rollout: extracting key entities, filtering relevant concept groups, and associative reasoning by constructing new triplets through analogy. The entire pipeline is optimized end-to-end with GRPO using a curriculum reward,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
