Revealing the working mechanism of quantum neural networks by mutual   information

Xin Zhang; Yuexian Hou

arXiv:2404.19312·quant-ph·May 1, 2024

Revealing the working mechanism of quantum neural networks by mutual information

Xin Zhang, Yuexian Hou

PDF

Open Access

TL;DR

This paper investigates the training mechanism of quantum neural networks using mutual information, revealing how information flows during training and correlates with learning and generalization.

Contribution

It introduces a method to analyze QNNs by dividing the circuit into subsystems and tracking mutual information, providing insights into their working mechanism.

Findings

01

Mutual information I(Di : Mo) increases during training, indicating information transfer to measurement subsystem.

02

I(Mi : Mo) exhibits a two-phase behavior, first increasing then decreasing, aligning with feature fitting and generalization.

03

The analysis links mutual information dynamics to QNNs' accuracy and generalization capabilities.

Abstract

Quantum neural networks (QNNs) is a parameterized quantum circuit model, which can be trained by gradient-based optimizer, can be used for supervised learning, regression tasks, combinatorial optimization, etc. Although many works have demonstrated that QNNs have better learnability, generalizability, etc. compared to classical neural networks. However, as with classical neural networks, we still can't explain their working mechanism well. In this paper, we reveal the training mechanism of QNNs by mutual information. Unlike traditional mutual information in neural networks, due to quantum computing remains information conserved, the mutual information is trivial of the input and output of U operator. In our work, in order to observe the change of mutual information during training, we divide the quantum circuit (U operator) into two subsystems, discard subsystem (D) and measurement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications