Model selection in hybrid quantum neural networks with applications to quantum transformer architectures
Harsh Wadhwa, Rahul Bhowmick, Naipunnya Raj, Rajiv Sangle, Ruchira V. Bhat, Krishnakumar Sabapathy

TL;DR
This paper introduces the Quantum Bias-Expressivity Toolbox ($ exttt{QBET}$), a framework for evaluating hybrid quantum-classical transformer models, enabling efficient model selection and demonstrating quantum advantages in specific tasks.
Contribution
The paper develops $ exttt{QBET}$ with novel metrics for bias and expressivity, facilitating pre-screening of quantum models without full training, and applies it to quantum transformer architectures.
Findings
Quantum self-attention can outperform classical models in certain scenarios.
$ exttt{QBET}$ effectively predicts promising quantum model variants.
Efficient evaluation reduces resource-intensive training processes.
Abstract
Quantum machine learning models generally lack principled design guidelines, often requiring full resource-intensive training across numerous choices of encodings, quantum circuit designs and initialization strategies to find effective configuration. To address this challenge, we develope the Quantum Bias-Expressivity Toolbox (), a framework for evaluating quantum, classical, and hybrid transformer architectures. In this toolbox, we introduce lean metrics for Simplicity Bias () and Expressivity (), for comparing across various models, and extend the analysis of to generative and multiclass-classification tasks. We show that enables efficient pre-screening of promising model variants obviating the need to execute complete training pipelines. In evaluations on transformer-based classification and generative tasks we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Quantum many-body systems · Machine Learning in Materials Science
