Network Aware Compute and Memory Allocation in Optically Composable Data Centres with Deep Reinforcement Learning and Graph Neural Networks
Zacharaya Shabka, Georgios Zervas

TL;DR
This paper introduces a deep reinforcement learning approach using graph neural networks for resource allocation in optically composable data centres, achieving higher efficiency and scalability than existing heuristics.
Contribution
It presents a novel end-to-end method combining deep reinforcement learning and graph neural networks for scalable, network-aware resource allocation in disaggregated data centres.
Findings
Up to 20% higher acceptance ratio compared to heuristics.
Achieves same acceptance with 3x less networking resources.
Maintains performance on topologies 100x larger than training data.
Abstract
Resource-disaggregated data centre architectures promise a means of pooling resources remotely within data centres, allowing for both more flexibility and resource efficiency underlying the increasingly important infrastructure-as-a-service business. This can be accomplished by means of using an optically circuit switched backbone in the data centre network (DCN); providing the required bandwidth and latency guarantees to ensure reliable performance when applications are run across non-local resource pools. However, resource allocation in this scenario requires both server-level \emph{and} network-level resource to be co-allocated to requests. The online nature and underlying combinatorial complexity of this problem, alongside the typical scale of DCN topologies, makes exact solutions impossible and heuristic based solutions sub-optimal or non-intuitive to design. We demonstrate that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optical Network Technologies · Cloud Computing and Resource Management · Optical Network Technologies
