Tensor Methods in Computer Vision and Deep Learning

Yannis Panagakis; Jean Kossaifi; Grigorios G. Chrysos; James Oldfield,; Mihalis A. Nicolaou; Anima Anandkumar; Stefanos Zafeiriou

arXiv:2107.03436·cs.CV·July 9, 2021

Tensor Methods in Computer Vision and Deep Learning

Yannis Panagakis, Jean Kossaifi, Grigorios G. Chrysos, James Oldfield,, Mihalis A. Nicolaou, Anima Anandkumar, Stefanos Zafeiriou

PDF

TL;DR

This paper reviews the role of tensor methods in computer vision and deep learning, highlighting their applications in data representation, architecture design, robustness, and theoretical understanding, supported by practical Python examples.

Contribution

It provides a comprehensive review of tensor techniques in deep learning for vision, including recent advances and practical implementations with TensorLy.

Findings

01

Tensor methods enhance visual data analysis and deep learning architectures.

02

They improve robustness to noise and adversarial attacks.

03

Tensor techniques facilitate theoretical insights into deep networks.

Abstract

Tensors, or multidimensional arrays, are data structures that can naturally represent visual data of multiple dimensions. Inherently able to efficiently capture structured, latent semantic spaces and high-order interactions, tensors have a long history of applications in a wide span of computer vision problems. With the advent of the deep learning paradigm shift in computer vision, tensors have become even more fundamental. Indeed, essential ingredients in modern deep learning architectures, such as convolutions and attention mechanisms, can readily be considered as tensor mappings. In effect, tensor methods are increasingly finding significant applications in deep learning, including the design of memory and compute efficient network architectures, improving robustness to random noise and adversarial attacks, and aiding the theoretical understanding of deep networks. This article…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.