Characterizing and Improving the Resilience of Accelerators in Autonomous Robots
Deval Shah, Zi Yu Xue, Karthik Pattabiraman, and Tor M. Aamodt

TL;DR
This paper introduces the Collision Exposure Factor (CEF), a new metric to evaluate and improve the soft-error resilience of motion planning accelerators in autonomous robots, reducing failure rates and aiding fault characterization.
Contribution
The paper proposes CEF as a novel vulnerability metric, demonstrates its effectiveness in error mitigation, fault characterization, and provides design guidelines for resilient MPAs.
Findings
CEF correlates with safety violation probability.
CEF-aware mitigation reduces failure rates by up to 12.3x.
CEF enables fault characterization with 23,000x fewer experiments.
Abstract
Motion planning is a computationally intensive and well-studied problem in autonomous robots. However, motion planning hardware accelerators (MPA) must be soft-error resilient for deployment in safety-critical applications, and blanket application of traditional mitigation techniques is ill-suited due to cost, power, and performance overheads. We propose Collision Exposure Factor (CEF), a novel metric to assess the failure vulnerability of circuits processing spatial relationships, including motion planning. CEF is based on the insight that the safety violation probability increases with the surface area of the physical space exposed by a bit-flip. We evaluate CEF on four MPAs. We demonstrate empirically that CEF is correlated with safety violation probability, and that CEF-aware selective error mitigation provides 12.3x, 9.6x, and 4.2x lower Failures-In-Time (FIT) rate on average for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiation Effects in Electronics · Security and Verification in Computing · Physical Unclonable Functions (PUFs) and Hardware Security
