FASHION: Fault-Aware Self-Healing Intelligent On-chip Network
Pengju Ren, Michel A.Kinsy, Mengjiao Zhu, Shreeya Khadka, Mihailo, Isakov, Aniruddh Ramrakhyani, Tushar Krishna, Nanning Zheng

TL;DR
The paper presents Fashion, a fault-aware self-healing on-chip network router that dynamically detects and reconfigures around faults, significantly improving fault tolerance and network resilience in multicore systems.
Contribution
Introduces the Fashion router with a distributed self-awareness module and reconfiguration capabilities, enhancing fault tolerance without topology restrictions.
Findings
Reduces node drops by 54.3-55.4% under faults.
Maintains connectivity and deadlock-free routing.
Has low area overheads of around 2.3%.
Abstract
To avoid packet loss and deadlock scenarios that arise due to faults or power gating in multicore and many-core systems, the network-on-chip needs to possess resilient communication and load-balancing properties. In this work, we introduce the Fashion router, a self-monitoring and self-reconfiguring design that allows for the on-chip network to dynamically adapt to component failures. First, we introduce a distributed intelligence unit, called Self-Awareness Module (SAM), which allows the router to detect permanent component failures and build a network connectivity map. Using local information, SAM adapts to faults, guarantees connectivity and deadlock-free routing inside the maximal connected subgraph and keeps routing tables up-to-date. Next, to reconfigure network links or virtual channels around faulty/power-gated components, we add bidirectional link and unified virtual channel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterconnection Networks and Systems · Advanced Memory and Neural Computing · Radiation Effects in Electronics
