The False Dawn: Reevaluating Google's Reinforcement Learning for Chip Macro Placement
Igor L. Markov

TL;DR
This paper critically reevaluates Google's reinforcement learning approach for chip placement, revealing it underperforms compared to traditional methods and exposing issues in the original Nature publication's methodology and reporting.
Contribution
It provides a comprehensive meta-analysis that challenges the claims of Google's RL method, highlighting its inferior performance and methodological flaws.
Findings
Google RL lags behind human designers
RL is slower than Simulated Annealing and commercial software
Google's Nature paper contains significant errors and omissions
Abstract
Reinforcement learning (RL) for physical design of silicon chips in a Google 2021 Nature paper stirred controversy due to poorly documented claims that raised eyebrows and drew critical media coverage. The paper withheld critical methodology steps and most inputs needed to reproduce results. Our meta-analysis shows how two separate evaluations filled in the gaps and demonstrated that Google RL lags behind (i) human designers, (ii) a well-known algorithm (Simulated Annealing), and (iii) generally-available commercial software, while being slower; and in a 2023 open research contest, RL methods weren't in top 5. Crosschecked data indicate that the integrity of the Nature paper is substantially undermined owing to errors in conduct, analysis and reporting. Before publishing, Google rebuffed internal allegations of fraud, which still stand. We note policy implications and conclusions for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpen Source Software Innovations
