Theta-Resonance: A Single-Step Reinforcement Learning Method for Design Space Exploration
Masood S. Mortazavi, Tiancheng Qin, Ning Yan

TL;DR
Theta-Resonance introduces a single-step reinforcement learning approach using neural networks to efficiently explore design spaces by producing optimal samples based on evaluation feedback, with robust stability and minimal evaluations.
Contribution
The paper presents a novel single-step RL method, Theta-Resonance, for design space exploration that specializes policy gradient algorithms for efficient, stable sampling in categorical, continuous, and mixed spaces.
Findings
Effective policy network architectures identified for simple SoC design space.
Synthetic space exploration problems enable comparison and improvement of DSE algorithms.
Method outlined for extending Theta-Resonance to continuous and mixed spaces.
Abstract
Given an environment (e.g., a simulator) for evaluating samples in a specified design space and a set of weighted evaluation metrics -- one can use Theta-Resonance, a single-step Markov Decision Process (MDP), to train an intelligent agent producing progressively more optimal samples. In Theta-Resonance, a neural network consumes a constant input tensor and produces a policy as a set of conditional probability density functions (PDFs) for sampling each design dimension. We specialize existing policy gradient algorithms in deep reinforcement learning (D-RL) in order to use evaluation feedback (in terms of cost, penalty or reward) to update our policy network with robust algorithmic stability and minimal design evaluations. We study multiple neural architectures (for our policy network) within the context of a simple SoC design space and propose a method of constructing synthetic space…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBIM and Construction Integration
