Encoding Numerical Data for Generative Quantum Machine Learning
Michael Krebsbach, Florentin Reiter, Thomas Wellens, Hagen-Henrik Kowalski, Ali Abedi

TL;DR
This paper investigates how encoding numerical data affects the performance of quantum generative models, proposing Gray-code encoding to improve learning efficiency and accuracy.
Contribution
It introduces a Gray-code based encoding strategy for quantum generative models, reducing artificial correlations and preserving data structure, leading to better performance.
Findings
Gray-code encoding improves learning speed and accuracy
Standard binary encoding can obscure data structure
Gray-code encoding reduces artificial correlations
Abstract
Generative quantum machine learning models are trained to deduce the probability distribution underlying a given dataset, and to produce new, synthetic samples from it. The majority of such models proposed in the literature, like the Quantum Circuit Born Machine (QCBM), fundamentally work on a binary level. Real-world data, however, is often numeric, requiring the models to translate between binary and continuous representations. We analyze how this transition influences the performance of quantum models and show that it requires the models to learn correlations that are solely an artifact of the way the data is encoded, and not related to the data itself. At the same time, structure of the original data can be obscured in the binary representation, hindering generalization. To mitigate these effects, we propose a strategy based on Gray-codes that can be implemented with essentially no…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Quantum many-body systems · Quantum Information and Cryptography
