Projecting Latent RL Actions: Towards Generalizable and Scalable Graph Combinatorial Optimization

Franco Terranova (UL; LORIA; Inria); Guillermo Bernardez (UC Santa Barbara); Albert Cabellos-Aparicio (UPC); Nina Miolane (UC Santa Barbara); Abdelkader Lahmadi (LORIA; UL; Inria)

arXiv:2605.19721·cs.AI·May 20, 2026

Projecting Latent RL Actions: Towards Generalizable and Scalable Graph Combinatorial Optimization

Franco Terranova (UL, LORIA, Inria), Guillermo Bernardez (UC Santa Barbara), Albert Cabellos-Aparicio (UPC), Nina Miolane (UC Santa Barbara), Abdelkader Lahmadi (LORIA, UL, Inria)

PDF

1 Repo

TL;DR

This paper introduces projection agents, a novel RL-GCO method that operates in a continuous latent space for scalable, generalizable graph combinatorial optimization, achieving faster inference and better generalization.

Contribution

The authors propose a new RL-GCO approach using latent action spaces, enabling scalable, generalizable solutions and providing a Python library for reproducibility.

Findings

01

Achieves up to 16.2x faster inference

02

Improves generalization by up to 40%

03

Supports super-linear decision spaces with interdependent variables

Abstract

Graph combinatorial optimization (GCO) has attracted growing interest, as many NP-hard problems naturally admit graph formulations, yet their combinatorial explosion renders exact methods computationally intractable. Recent advances in Reinforcement Learning (RL) combined with Graph Neural Networks (GNNs) have significantly improved learning-based GCO solvers. However, existing approaches face limitations in both generalization across diverse graph instances and computational scalability as action spaces grow. To address both challenges, we introduce projection agents, a novel RL-GCO approach that operates directly in a continuous GNN-based action embedding space, predicting a desired latent action in a single forward pass and subsequently decoding it into a valid discrete action. Additionally, we enable fair comparison across RL methods through a shared embedding space for both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

null
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.