Do Quantum Transformers Help? A Systematic VQC Architecture Comparison on Tabular Benchmarks
Chi-Sheng Chen, En-Jui Kuo

TL;DR
This paper systematically compares various variational quantum circuit architectures on tabular data benchmarks, revealing insights into their accuracy, parameter efficiency, and robustness, guiding practical deployment on near-term quantum hardware.
Contribution
It provides a comprehensive empirical evaluation of four VQC families, highlighting the effectiveness of simple fully-connected circuits and the role of normalization in quantum transformers.
Findings
FC-VQCs achieve 90-96% of attention-based VQCs' R^2 with fewer parameters.
Explicit quantum self-attention offers marginal gains but increases complexity.
Shallow VQCs (depth~3) effectively cover the Hilbert space.
Abstract
Variational quantum circuits (VQCs) are a leading approach to quantum machine learning on near-term devices, yet it remains unclear which circuit architecture yields the best accuracy-parameter trade-off on classical tabular data. We present a systematic empirical comparison of four VQC families -- multi-layer fully-connected (FC-VQC), residual (ResNet-VQC), hybrid quantum-classical transformer (QT), and fully quantum transformer (FQT) -- across five regression and classification benchmarks. Our key findings are: \textbf{(i)}~FC-VQCs achieve 90-96\% of the of attention-based VQCs while using 40-50\% fewer parameters, and consistently outperform equal-capacity MLPs (mean vs.\ MLP's on Boston Housing, 3-seed average); \textbf{(ii)}~FC-VQC's Type~4 inter-block connectivity provides partial cross-token mixing that approximates the role of attention --…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
