SAKE: Structured Agentic Knowledge Extrapolation for Complex LLM Reasoning via Reinforcement Learning

Jiashu He; Jinxuan Fan; Bowen Jiang; Ignacio Houine; Dan Roth; Alejandro Ribeiro

arXiv:2505.15062·cs.CL·April 3, 2026

SAKE: Structured Agentic Knowledge Extrapolation for Complex LLM Reasoning via Reinforcement Learning

Jiashu He, Jinxuan Fan, Bowen Jiang, Ignacio Houine, Dan Roth, Alejandro Ribeiro

PDF

1 Repo

TL;DR

SAKE introduces a reinforcement learning framework enabling small language models to autonomously retrieve and extrapolate structured knowledge, significantly improving reasoning in biomedical and commonsense domains.

Contribution

It presents a novel RL-based agentic framework with tool-augmented retrieval for structured knowledge extrapolation in LLMs, outperforming larger models.

Findings

01

SAKE fine-tuned Qwen2.5-7B surpasses GPT-3.5-Turbo on benchmarks.

02

Reduces token usage by over 90%.

03

Enables end-to-end learning of associative reasoning with small models.

Abstract

Knowledge extrapolation is the process of inferring novel information by combining and extending existing knowledge that is explicitly available. It is essential for solving complex questions in specialized domains where retrieving comprehensive external knowledge is impractical. We propose SAKE (Structured Agentic Knowledge Extrapolation), a RL powered agentic framework that trains LLMs to autonomously retrieve and extrapolate structured knowledge through tool-augmented reinforcement learning. SAKE defines two external KG tools: entity group construction and cross-group triplet retrieval. The model learns to interleave these 2 retrieval tools during a three-turn rollout: extracting key entities, filtering relevant concept groups, and associative reasoning by constructing new triplets through analogy. The entire pipeline is optimized end-to-end with GRPO using a curriculum reward,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://anonymous.4open.science/r/SAKE-7585
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.