Scaling Quantum Machine Learning without Tricks: High-Resolution and Diverse Image Generation
Jonas J\"ager, Florian J. Kiwit, Carlos A. Riofr\'io

TL;DR
This paper demonstrates a novel quantum generative model capable of producing high-resolution, diverse images across multiple datasets without relying on traditional tricks, marking a significant advancement in quantum machine learning.
Contribution
The authors introduce a quantum Wasserstein GAN that generates full-resolution images from complete datasets, surpassing previous limitations and establishing a new state-of-the-art in quantum image generation.
Findings
Achieved high-resolution image generation on MNIST and Fashion-MNIST datasets.
Extended approach to color images with Street View House Numbers dataset.
Maintained performance under quantum shot noise conditions.
Abstract
Quantum generative modeling is a rapidly evolving discipline at the intersection of quantum computing and machine learning. Contemporary quantum machine learning is generally limited to toy examples or heavily restricted datasets with few elements. This is not only due to the current limitations of available quantum hardware but also due to the absence of inductive biases arising from application-agnostic designs. Current quantum solutions must resort to tricks to scale down high-resolution images, such as relying heavily on dimensionality reduction or utilizing multiple quantum models for low-resolution image patches. Building on recent developments in classical image loading to quantum computers, we circumvent these limitations and train quantum Wasserstein GANs on the established classical MNIST and Fashion-MNIST datasets. Using the complete datasets, our system generates…
Peer Reviews
Decision·Submitted to ICLR 2026
- The paper identifies and addresses a primary challenge in QML: the inability of most quantum models to handle high-dimensional classical data directly. - The ablation between task-agnostic vs. FRQI-aligned circuits is convincing and highlights the importance of structured ansatz in QML. The ablation study in Figure 4 successfully demonstrates that a circuit ansatz specifically designed for the FRQI encoding produces higher-quality images than a generic ansatz, though this result is somewhat ex
- All results are purely classical simulations, making claims of scalability speculative. The proposed models use 11-13 qubits and 64-layer deep circuits with >10k parameters, which are far beyond the capabilities of NISQ hardware. No evidence is provided that these circuits can be trained or executed on real devices or even on moderate-scale noisy simulators. No evidence is provided that these circuits are not suffering from the barren plateau phenomenon. - The entire premise of the paper's "s
1. One key strength of this paper is the extensive experimental validation across multiple full-resolution datasets, including MNIST, Fashion-MNIST, and SVHN. 2. The authors demonstrate strong empirical performance, showcasing the scalability and robustness of their quantum generative model. 3. The work also stands out for eliminating classical post-processing and relying solely on quantum-native design, which enhances its novelty and practical relevance.
The paper lacks a clear and detailed structural presentation of the proposed framework. As a result, the overall methodology remains vague, making it difficult for readers to grasp the model’s design and workflow. The core approach builds on Quantum Wasserstein GANs, which were introduced by Chakrabarti et al. in 2019 (NeurIPS 32). However, the current work does not offer substantial architectural innovation beyond that baseline, raising concerns about novelty. Although the authors present ext
Scaling is one of the most important issue regarding to the quantum machine learning models. Though heuristic, it is good to see that the work makes some positive progress towards this point. The overmoding, i.e., more input noise modes, inspired by the classical over-parameterization, seems to be interesting. Shown as Fig.6 in the paper, the numerical result agrees with the authors' argument that increasing mode can increase the model performance. The size for QGAN models with 64 layers and 40
The weakness of this paper is that it does have any theoretical analysis to support their claim, and also the method is model-specific, and thus it is not that clear how good this method will be.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Quantum many-body systems · Quantum-Dot Cellular Automata
