Multi-conditioned Graph Diffusion for Neural Architecture Search

Rohan Asthana; Joschua Conrad; Youssef Dawoud; Maurits Ortmanns,; Vasileios Belagiannis

arXiv:2403.06020·cs.LG·March 25, 2024·1 cites

Multi-conditioned Graph Diffusion for Neural Architecture Search

Rohan Asthana, Joschua Conrad, Youssef Dawoud, Maurits Ortmanns,, Vasileios Belagiannis

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces a differentiable graph diffusion-based neural architecture search method that efficiently generates high-performance architectures with constraints like low latency, demonstrating fast speed and strong results on multiple benchmarks.

Contribution

The paper proposes a novel, fully differentiable graph diffusion approach for neural architecture search that requires only one training process and effectively incorporates multiple constraints.

Findings

01

Achieves high-quality architectures in less than 0.2 seconds per architecture.

02

Demonstrates strong performance on six standard benchmarks.

03

Shows good generalizability and efficiency on ImageNet.

Abstract

Neural architecture search automates the design of neural network architectures usually by exploring a large and thus complex architecture search space. To advance the architecture search, we present a graph diffusion-based NAS approach that uses discrete conditional graph diffusion processes to generate high-performing neural network architectures. We then propose a multi-conditioned classifier-free guidance approach applied to graph diffusion networks to jointly impose constraints such as high accuracy and low hardware latency. Unlike the related work, our method is completely differentiable and requires only a single model training. In our evaluations, we show promising results on six standard benchmarks, yielding novel and unique architectures at a fast speed, i.e. less than 0.2 seconds per architecture. Furthermore, we demonstrate the generalisability and efficiency of our method…

Peer Reviews

Decision·ICLR 2024 Conference Withdrawn Submission

Reviewer 01Rating 3· reject, not good enoughConfidence 4

Strengths

1. This paper is well-written and easy to follow. 2. Using graph diffusion to generate architectures is a reasonable idea.

Weaknesses

1. The method is very straightforward. The method designs lack the relation with NAS problems, making the technical contribution limited. 2. There has been diffusion-based NAS method [1], but the paper does not mention the difference between the two methods. 3. All experiment are conducted on NAS benchmarks. In my opinion, NAS benchmarks provide quick feedback for developing new NAS methods, but NAS methods still need evaluation out of NAS benchmark to avoid overfitting on these benchmarks. 4. I

Reviewer 02Rating 5· marginally below the acceptance thresholdConfidence 3

Strengths

- This paper is well written and easy to follow. - This paper is the first work to introduce classifier-free diffusion model for neural architecture generation. - This paper conduct extensive experiments on various benchmarks.

Weaknesses

The reviewer's main concerns are as follows: - Lack of experimental support for the contribution - The proposed method introduces a multi-conditioned diffusion guidance technique as one of its primary contributions. However, most experiments have been conducted with a single objective, primarily focusing on accuracy. This limitation makes it challenging to have confidence in the claim about the capabilities of the proposed method. To address this issue, it is recommended to conduct addition

Reviewer 03Rating 3· reject, not good enoughConfidence 5

Strengths

- They present a multi-conditioned graph diffusion model. - They conducted experiments not only in the vision domain but also in the NLP domain.

Weaknesses

- The authors insist that the potential of classifier-free guidance in graph diffusion networks remains unexplored (and also said that the current classifier-free guidance approaches operate only on the image synthesis and are single-conditioned), but there are some works exploring classifier-free guidance in graph diffusion networks e.g., [1]. This is not an exact statement. - Many core ideas overlap with the related work [2] that the authors did not mention, such as the first adoption of dif

Code & Models

Repositories

rohanasthana/dinas
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Graph Theory and Algorithms · Advanced Graph Neural Networks

MethodsDiffusion