Distilling LLM Reasoning into Graph of Concept Predictors

Ziyang Yu; Liang Zhao

arXiv:2602.03006·cs.AI·April 1, 2026

Distilling LLM Reasoning into Graph of Concept Predictors

Ziyang Yu, Liang Zhao

PDF

1 Repo

TL;DR

This paper introduces Graph of Concept Predictors (GCP), a reasoning-aware active distillation framework that improves LLM training efficiency, interpretability, and performance by externalizing decision processes as a graph and focusing on critical reasoning nodes.

Contribution

GCP is a novel framework that externalizes LLM reasoning as a graph, enhancing sample efficiency, interpretability, and training stability through targeted sub-module retraining.

Findings

01

GCP improves performance on NLP classification benchmarks with limited annotations.

02

GCP yields more interpretable and controllable training dynamics.

03

GCP outperforms traditional distillation methods in efficiency and accuracy.

Abstract

Deploying Large Language Models (LLMs) for discriminative workloads is often limited by inference latency, compute, and API costs at scale. Active distillation reduces these costs by querying an LLM oracle to train compact discriminative students, but most pipelines distill only final labels, discarding intermediate reasoning signals and offering limited diagnostics of what reasoning is missing and where errors arise. We propose Graph of Concept Predictors (GCP), a reasoning-aware active distillation framework that externalizes the teacher's decision process as a directed acyclic graph and mirrors it with modular concept predictors in the student. GCP enhances sample efficiency through a graph-aware acquisition strategy that targets uncertainty and disagreement at critical reasoning nodes. Additionally, it improves training stability and efficiency by performing targeted sub-module…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Ziyang-Yu/GCP
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.