Toward Self-Healing Networks-on-Chip: RL-Driven Routing in 2D Torus Architectures

Mohammad Walid Charrwi; Zaid Hussain

arXiv:2512.13096·cs.DC·December 16, 2025

Toward Self-Healing Networks-on-Chip: RL-Driven Routing in 2D Torus Architectures

Mohammad Walid Charrwi, Zaid Hussain

PDF

Open Access

TL;DR

This paper demonstrates that reinforcement learning-based routing in 2D torus Networks-on-Chip significantly improves throughput and fault resilience compared to traditional adaptive routing, especially under high fault conditions.

Contribution

It introduces an RL-driven routing strategy for NoCs that outperforms fixed adaptive schemes in fault tolerance and throughput, with detailed simulation validation.

Findings

01

RL routing achieves 20-30% higher throughput at high load.

02

RL maintains PDR above 90% up to 30-40% faults.

03

RL preserves network connectivity longer under faults.

Abstract

We investigate adaptive minimal routing in 2D torus networks on chip NoCs under node fault conditions comparing a reinforcement learning RL based strategy to an adaptive routing baseline A torus topology is used for its low diameter high connectivity properties The RL approach models each router as an agent that learns to forward packets based on network state while the adaptive scheme uses fixed minimal paths with simple rerouting around faults We implement both methods in simulation injecting up to 50 node faults uniformly at random Key metrics are measured 1 throughput vs offered load at fault density 02 2 packet delivery ratio PDR vs fault density and 3 a fault adaptive score FT vs fault density Experimental results show the RL method achieves significantly higher throughput at high load approximately 2030 gain and maintains higher reliability under increasing faults The RL router…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInterconnection Networks and Systems · Supercapacitor Materials and Fabrication · Quantum-Dot Cellular Automata