Tensor Methods in Computer Vision and Deep Learning
Yannis Panagakis, Jean Kossaifi, Grigorios G. Chrysos, James Oldfield,, Mihalis A. Nicolaou, Anima Anandkumar, Stefanos Zafeiriou

TL;DR
This paper reviews the role of tensor methods in computer vision and deep learning, highlighting their applications in data representation, architecture design, robustness, and theoretical understanding, supported by practical Python examples.
Contribution
It provides a comprehensive review of tensor techniques in deep learning for vision, including recent advances and practical implementations with TensorLy.
Findings
Tensor methods enhance visual data analysis and deep learning architectures.
They improve robustness to noise and adversarial attacks.
Tensor techniques facilitate theoretical insights into deep networks.
Abstract
Tensors, or multidimensional arrays, are data structures that can naturally represent visual data of multiple dimensions. Inherently able to efficiently capture structured, latent semantic spaces and high-order interactions, tensors have a long history of applications in a wide span of computer vision problems. With the advent of the deep learning paradigm shift in computer vision, tensors have become even more fundamental. Indeed, essential ingredients in modern deep learning architectures, such as convolutions and attention mechanisms, can readily be considered as tensor mappings. In effect, tensor methods are increasingly finding significant applications in deep learning, including the design of memory and compute efficient network architectures, improving robustness to random noise and adversarial attacks, and aiding the theoretical understanding of deep networks. This article…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
