Resource-Aware Replication on Heterogeneous Multicores: Challenges and Opportunities
Bj\"orn D\"obel, Robert Muschner, Hermann H\"artig

TL;DR
This paper discusses the development of ROMAIN, an OS service for redundant multithreading on heterogeneous multicore systems, addressing hardware unreliability and exploring adaptation challenges and opportunities.
Contribution
Introduction of ROMAIN, a novel OS service for fault-tolerant multithreading on heterogeneous multicore platforms, with analysis of adaptation challenges and potential benefits.
Findings
ROMAIN reduces execution overhead in fault-tolerant multithreading.
It addresses resource constraints in heterogeneous multicore environments.
The paper identifies key challenges and opportunities for adapting ROMAIN to diverse platforms.
Abstract
Decreasing hardware feature sizes and increasing heterogeneity in multicore hardware require software that can adapt to these platforms' properties. We implemented ROMAIN, an OS service providing redundant multithreading on top of the FIASCO.OC microkernel to address the increasing unreliability of hardware. In this paper we review challenges and opportunities for ROMAIN to adapt to such multicore platforms in order to decrease execution overhead, resource requirements, and vulnerability against faults.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Parallel Computing and Optimization Techniques · Radiation Effects in Electronics
