Tensor-Augmented Convolutional Neural Networks: Enhancing Expressivity with Generic Tensor Kernels

Chia-Wei Hsing; Wei-Lin Tu

arXiv:2604.08072·cs.CV·April 10, 2026

Tensor-Augmented Convolutional Neural Networks: Enhancing Expressivity with Generic Tensor Kernels

Chia-Wei Hsing, Wei-Lin Tu

PDF

TL;DR

The paper introduces tensor-augmented CNNs (TACNNs), which replace traditional kernels with tensors to enhance expressivity, achieving deep learning performance with shallower, more interpretable models.

Contribution

Proposes a physically-guided shallow model, TACNN, using generic tensors to significantly improve expressivity over conventional CNNs with fewer layers.

Findings

01

TACNN achieves 93.7% accuracy on Fashion-MNIST with only two layers.

02

TACNN outperforms or matches deeper models like VGG-16 and GoogLeNet.

03

TACNN offers a more interpretable and efficient architecture.

Abstract

Convolutional Neural Networks (CNNs) excel at extracting local features hierarchically, but their performance in capturing complex correlations hinges heavily on deep architectures, which are usually computationally demanding and difficult to interpret. To address these issues, we propose a physically-guided shallow model: tensor-augmented CNN (TACNN), which replaces conventional convolution kernels with generic tensors to enhance representational capacity. This choice is motivated by the fact that an order- $N$ tensor naturally encodes an arbitrary quantum superposition state in the Hilbert space of dimension $d^{N}$ , where $d$ is the local physical dimension, thus offering substantially richer expressivity. Furthermore, in our design the convolution output of each layer becomes a multilinear form capable of capturing high-order feature correlations, thereby equipping a shallow multilayer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.