Generating a Graph Colouring Heuristic with Deep Q-Learning and Graph Neural Networks
George Watkins, Giovanni Montana, and Juergen Branke

TL;DR
This paper explores using deep reinforcement learning with graph neural networks to develop a new heuristic for the graph colouring problem, showing promising results compared to existing algorithms.
Contribution
It introduces ReLCol, a novel deep Q-learning based heuristic that leverages graph neural networks and a new graph parameterization for improved performance.
Findings
ReLCol outperforms some existing heuristics on benchmark graphs.
Deep reinforcement learning shows potential for graph colouring heuristics.
The approach has limitations on certain graph topologies.
Abstract
The graph colouring problem consists of assigning labels, or colours, to the vertices of a graph such that no two adjacent vertices share the same colour. In this work we investigate whether deep reinforcement learning can be used to discover a competitive construction heuristic for graph colouring. Our proposed approach, ReLCol, uses deep Q-learning together with a graph neural network for feature extraction, and employs a novel way of parameterising the graph that results in improved performance. Using standard benchmark graphs with varied topologies, we empirically evaluate the benefits and limitations of the heuristic learned by ReLCol relative to existing construction algorithms, and demonstrate that reinforcement learning is a promising direction for further research on the graph colouring problem.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Timetabling Solutions
MethodsGraph Neural Network · Q-Learning
