Deep Tree Tensor Networks for Image Recognition
Chang Nie, Junfang Chen, Yajie Chen

TL;DR
This paper introduces Deep Tree Tensor Networks (DTTN), a novel architecture that captures high-order feature interactions for image recognition, outperforming existing models and offering interpretability inspired by quantum physics.
Contribution
The paper proposes DTTN, a tree-structured tensor network that efficiently models exponential feature interactions and demonstrates theoretical equivalence to polynomial and multilinear networks.
Findings
Achieves state-of-the-art performance on image recognition benchmarks.
Provides theoretical insights linking TN models to polynomial networks.
Offers a scalable, interpretable tensor network architecture for vision tasks.
Abstract
Originating in quantum physics, tensor networks (TNs) have been widely adopted as exponential machines and parameter decomposers for recognition tasks. Typical TN models, such as Matrix Product States (MPS), have not yet achieved successful application in natural image processing. When employed, they primarily serve to compress parameters within off-the-shelf networks, thus losing their distinctive capability to enhance exponential-order feature interactions. This paper introduces a novel architecture named \textit{\textbf{D}eep \textbf{T}ree \textbf{T}ensor \textbf{N}etwork} (DTTN), which captures -order multiplicative interactions across features through multilinear operations, while essentially unfolding into a \emph{tree}-like TN topology with the parameter-sharing property. DTTN is stacked with multiple antisymmetric interacting modules (AIMs), and this design facilitates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications
