AGDC: Autoregressive Generation of Variable-Length Sequences with Joint Discrete and Continuous Spaces
Yeonsang Shin, Insoo Kim, Bongkeun Kim, Keonwoo Bae, Bohyung Han

TL;DR
AGDC introduces a unified autoregressive framework that jointly models discrete and continuous data, enabling high-precision sequence generation in complex domains like semiconductor design, surpassing existing discretization-based methods.
Contribution
The paper proposes AGDC, a novel hybrid model combining categorical and diffusion-based approaches with dynamic EOS adjustment and length regularization for scalable high-precision sequence generation.
Findings
AGDC outperforms baselines in semiconductor layout generation.
The model achieves high fidelity in diverse vector data.
Experiments demonstrate scalability and precision in complex domains.
Abstract
Transformer-based autoregressive models excel in data generation but are inherently constrained by their reliance on discretized tokens, which limits their ability to represent continuous values with high precision. We analyze the scalability limitations of existing discretization-based approaches for generating hybrid discrete-continuous sequences, particularly in high-precision domains such as semiconductor circuit designs, where precision loss can lead to functional failure. To address the challenge, we propose AGDC, a novel unified framework that jointly models discrete and continuous values for variable-length sequences. AGDC employs a hybrid approach that combines categorical prediction for discrete values with diffusion-based modeling for continuous values, incorporating two key technical components: an end-of-sequence (EOS) logit adjustment mechanism that uses an MLP to…
Peer Reviews
Decision·Submitted to ICLR 2026
* Joint models of discrete and continuous values in autoregressive model is useful for modeling and generating complex data. * The paper proposes a new benchmark for circuit layout. * The experiments are showing that the encoding and generation work on multiple problems, indicating the generality of the proposed approach. Out of the three parts of the experiment, the circuit layout is the most interesting case study.
* The paper is light on theoretical contributions. The technical approach seems as a combination of known models, which translates to explaining “how” the approach works, leaving “why” out. * The motivation behind EOS logit adjustment was not clear. * The results on the chip layout problem seem notable. The table shows improvement in several existing metrics in the domain. However, these metrics seem very problem specific and their motivation is not well explained for the general audience. Th
**S1.** The proposed methodology is conceptually sound and neat. **S2.** The paper introduces ContLayNet, a real-world large-scale dataset of semiconductor layout samples. **S3.** Empirical studies (quantitative and qualitative) demonstrate the effectiveness of the proposed approach, with ablation studies provided for various design choices. **S4.** The paper is overall easy to follow.
**W1.** The baselines are limited. Even though some generative models were not designed towards layout generation in particular, they may still be adaptable to this scenario. **W2.** The paper does not discuss existing papers that combine autoregressive models and diffusion models. E.g., - Chen et al. Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion. NeurIPS 2024. - Zhao et al. Pard: Permutation-Invariant Autoregressive Diffusion for Graph Generation. NeurIPS 2024. - Li
1. The paper identifies a critical and practical limitation of discretization in generative models. The proposed hybrid approach of combining autoregressive prediction with an inner loop of diffusion sampling is an elegant and novel method to preserve continuous precision. 2. The introduction of the ContLayNet benchmark and its specialized DRC-based evaluation metrics is a major contribution. This provides a much-needed and challenging testbed for a problem space that has been underserved.
1. A major concern is the inference speed. The model must run an iterative diffusion sampling process (which is itself multi-step) for every single autoregressive step. This seems computationally prohibitive and likely orders of magnitude slower than discretization-based autoregressive models, potentially limiting its practical utility. 2. It remains unclear what this paper contributes to the machine learning (generative models) community.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · VLSI and FPGA Design Techniques · Advanced Neural Network Applications
