Critic-Driven Voronoi-Quantization for Distilling Deep RL Policies to Explainable Models

Senne Deproost; Denis Steckelmacher; Ann Now\'e

arXiv:2605.14897·cs.LG·May 15, 2026

Critic-Driven Voronoi-Quantization for Distilling Deep RL Policies to Explainable Models

Senne Deproost, Denis Steckelmacher, Ann Now\'e

PDF

TL;DR

This paper introduces a critic-driven Voronoi quantization method for distilling deep RL policies into interpretable models, balancing performance and interpretability by leveraging the critic network.

Contribution

It proposes a novel, model-agnostic Voronoi-based partitioning technique that uses the critic to guide the creation of simple, interpretable policies from complex RL models.

Findings

01

Successfully distills policies with a small set of linear functions

02

Outperforms traditional distillation in balancing interpretability and performance

03

Validated on several well-known benchmarks

Abstract

Despite many successful attempts at explaining Deep Reinforcement Learning policies using distillation, it remains difficult to balance the performance-interpretability trade-off and select a fitting surrogate model. In addition to this, traditional distillation only minimizes the distance between the behavior of the original and the surrogate policy while other RL-specific components such as action value are disregarded. To solve this, we introduce a new model-agnostic method called Critic-Driven Voronoi State Partitioning, which partitions a black box control policy into regions where a simple class of model can be optimized using gradient descent. By exploiting the critic value network of the original policy, we iteratively introduce new subpolicies in regions with insufficient value, standing in for a measure of policy complexity. The partitioning, a Voronoi quantizer, uses nearest…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.