{\mu}RL: Discovering Transient Execution Vulnerabilities Using Reinforcement Learning
M. Caner Tol, Kemal Derya, Berk Sunar

TL;DR
This paper introduces a reinforcement learning-based method to efficiently discover microarchitectural vulnerabilities like Spectre and Meltdown, outperforming traditional fuzzing techniques by learning from real-time feedback across various processor architectures.
Contribution
The paper presents a novel RL framework for discovering hardware vulnerabilities, demonstrating its effectiveness on Intel processors and uncovering new transient execution leakages.
Findings
RL approach successfully identified new transient execution vulnerabilities.
Method outperforms random fuzzing in efficiency and coverage.
Detected vulnerabilities include previously unknown instruction sequences.
Abstract
We propose using reinforcement learning to address the challenges of discovering microarchitectural vulnerabilities, such as Spectre and Meltdown, which exploit subtle interactions in modern processors. Traditional methods like random fuzzing fail to efficiently explore the vast instruction space and often miss vulnerabilities that manifest under specific conditions. To overcome this, we introduce an intelligent, feedback-driven approach using RL. Our RL agents interact with the processor, learning from real-time feedback to prioritize instruction sequences more likely to reveal vulnerabilities, significantly improving the efficiency of the discovery process. We also demonstrate that RL systems adapt effectively to various microarchitectures, providing a scalable solution across processor generations. By automating the exploration process, we reduce the need for human intervention,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Security and Verification in Computing · Software Testing and Debugging Techniques
