AGDC: Autoregressive Generation of Variable-Length Sequences with Joint Discrete and Continuous Spaces

Yeonsang Shin; Insoo Kim; Bongkeun Kim; Keonwoo Bae; Bohyung Han

arXiv:2601.05680·cs.LG·January 12, 2026

AGDC: Autoregressive Generation of Variable-Length Sequences with Joint Discrete and Continuous Spaces

Yeonsang Shin, Insoo Kim, Bongkeun Kim, Keonwoo Bae, Bohyung Han

PDF

Open Access 3 Reviews

TL;DR

AGDC introduces a unified autoregressive framework that jointly models discrete and continuous data, enabling high-precision sequence generation in complex domains like semiconductor design, surpassing existing discretization-based methods.

Contribution

The paper proposes AGDC, a novel hybrid model combining categorical and diffusion-based approaches with dynamic EOS adjustment and length regularization for scalable high-precision sequence generation.

Findings

01

AGDC outperforms baselines in semiconductor layout generation.

02

The model achieves high fidelity in diverse vector data.

03

Experiments demonstrate scalability and precision in complex domains.

Abstract

Transformer-based autoregressive models excel in data generation but are inherently constrained by their reliance on discretized tokens, which limits their ability to represent continuous values with high precision. We analyze the scalability limitations of existing discretization-based approaches for generating hybrid discrete-continuous sequences, particularly in high-precision domains such as semiconductor circuit designs, where precision loss can lead to functional failure. To address the challenge, we propose AGDC, a novel unified framework that jointly models discrete and continuous values for variable-length sequences. AGDC employs a hybrid approach that combines categorical prediction for discrete values with diffusion-based modeling for continuous values, incorporating two key technical components: an end-of-sequence (EOS) logit adjustment mechanism that uses an MLP to…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 2Confidence 2

Strengths

* Joint models of discrete and continuous values in autoregressive model is useful for modeling and generating complex data. * The paper proposes a new benchmark for circuit layout. * The experiments are showing that the encoding and generation work on multiple problems, indicating the generality of the proposed approach. Out of the three parts of the experiment, the circuit layout is the most interesting case study.

Weaknesses

* The paper is light on theoretical contributions. The technical approach seems as a combination of known models, which translates to explaining “how” the approach works, leaving “why” out. * The motivation behind EOS logit adjustment was not clear. * The results on the chip layout problem seem notable. The table shows improvement in several existing metrics in the domain. However, these metrics seem very problem specific and their motivation is not well explained for the general audience. Th

Reviewer 02Rating 6Confidence 3

Strengths

**S1.** The proposed methodology is conceptually sound and neat. **S2.** The paper introduces ContLayNet, a real-world large-scale dataset of semiconductor layout samples. **S3.** Empirical studies (quantitative and qualitative) demonstrate the effectiveness of the proposed approach, with ablation studies provided for various design choices. **S4.** The paper is overall easy to follow.

Weaknesses

**W1.** The baselines are limited. Even though some generative models were not designed towards layout generation in particular, they may still be adaptable to this scenario. **W2.** The paper does not discuss existing papers that combine autoregressive models and diffusion models. E.g., - Chen et al. Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion. NeurIPS 2024. - Zhao et al. Pard: Permutation-Invariant Autoregressive Diffusion for Graph Generation. NeurIPS 2024. - Li

Reviewer 03Rating 4Confidence 2

Strengths

1. The paper identifies a critical and practical limitation of discretization in generative models. The proposed hybrid approach of combining autoregressive prediction with an inner loop of diffusion sampling is an elegant and novel method to preserve continuous precision. 2. The introduction of the ContLayNet benchmark and its specialized DRC-based evaluation metrics is a major contribution. This provides a much-needed and challenging testbed for a problem space that has been underserved.

Weaknesses

1. A major concern is the inference speed. The model must run an iterative diffusion sampling process (which is itself multi-step) for every single autoregressive step. This seems computationally prohibitive and likely orders of magnitude slower than discretization-based autoregressive models, potentially limiting its practical utility. 2. It remains unclear what this paper contributes to the machine learning (generative models) community.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · VLSI and FPGA Design Techniques · Advanced Neural Network Applications