Is data-efficient learning feasible with quantum models?
Alona Sakhnenko, Christian B. Mendl, Jeanette M. Lorenz

TL;DR
This paper investigates the potential for quantum kernel methods to be more data-efficient than classical models, introducing new analytical tools and dataset generation techniques to explore the classical-quantum performance gap.
Contribution
It presents a method for generating classical datasets where QKMs outperform classical models in data efficiency and introduces a new analytical tool to assess the classical-quantum gap.
Findings
QKMs require less training data to achieve low error rates.
A new dataset generation method enables exploration of dataset properties.
The analytical generalization metric aligns well with empirical results.
Abstract
The importance of analyzing nontrivial datasets when testing quantum machine learning (QML) models is becoming increasingly prominent in literature, yet a cohesive framework for understanding dataset characteristics remains elusive. In this work, we concentrate on the size of the dataset as an indicator of its complexity and explores the potential for QML models to demonstrate superior data-efficiency compared to classical models, particularly through the lens of quantum kernel methods (QKMs). We provide a method for generating semi-artificial fully classical datasets, on which we show one of the first evidence of the existence of classical datasets where QKMs require less data during training. Additionally, our study introduces a new analytical tool to the QML domain, derived for classical kernel methods, which can be aimed at investigating the classical-quantum gap. Our empirical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
