Theta-Resonance: A Single-Step Reinforcement Learning Method for Design   Space Exploration

Masood S. Mortazavi; Tiancheng Qin; Ning Yan

arXiv:2211.02052·cs.LG·November 18, 2022

Theta-Resonance: A Single-Step Reinforcement Learning Method for Design Space Exploration

Masood S. Mortazavi, Tiancheng Qin, Ning Yan

PDF

Open Access

TL;DR

Theta-Resonance introduces a single-step reinforcement learning approach using neural networks to efficiently explore design spaces by producing optimal samples based on evaluation feedback, with robust stability and minimal evaluations.

Contribution

The paper presents a novel single-step RL method, Theta-Resonance, for design space exploration that specializes policy gradient algorithms for efficient, stable sampling in categorical, continuous, and mixed spaces.

Findings

01

Effective policy network architectures identified for simple SoC design space.

02

Synthetic space exploration problems enable comparison and improvement of DSE algorithms.

03

Method outlined for extending Theta-Resonance to continuous and mixed spaces.

Abstract

Given an environment (e.g., a simulator) for evaluating samples in a specified design space and a set of weighted evaluation metrics -- one can use Theta-Resonance, a single-step Markov Decision Process (MDP), to train an intelligent agent producing progressively more optimal samples. In Theta-Resonance, a neural network consumes a constant input tensor and produces a policy as a set of conditional probability density functions (PDFs) for sampling each design dimension. We specialize existing policy gradient algorithms in deep reinforcement learning (D-RL) in order to use evaluation feedback (in terms of cost, penalty or reward) to update our policy network with robust algorithmic stability and minimal design evaluations. We study multiple neural architectures (for our policy network) within the context of a simple SoC design space and propose a method of constructing synthetic space…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBIM and Construction Integration