Entropy Transformer Networks: A Learning Approach via Tangent Bundle   Data Manifold

Pourya Shamsolmoali; Masoumeh Zareapoor

arXiv:2307.12517·cs.CV·July 25, 2023

Entropy Transformer Networks: A Learning Approach via Tangent Bundle Data Manifold

Pourya Shamsolmoali, Masoumeh Zareapoor

PDF

Open Access

TL;DR

This paper introduces Entropy Transformer Networks (ESTN), a novel interpolation method on data manifolds that enhances CNN performance under scale variations by preserving gradient norms and improving accuracy.

Contribution

The paper proposes ESTN, which interpolates on data manifolds using tangent spaces and entropy regularization, addressing limitations of standard spatial transformer networks.

Findings

01

Improves accuracy on image reconstruction and classification tasks.

02

Reduces computational cost compared to existing methods.

03

Enhances gradient norm preservation during training.

Abstract

This paper focuses on an accurate and fast interpolation approach for image transformation employed in the design of CNN architectures. Standard Spatial Transformer Networks (STNs) use bilinear or linear interpolation as their interpolation, with unrealistic assumptions about the underlying data distributions, which leads to poor performance under scale variations. Moreover, STNs do not preserve the norm of gradients in propagation due to their dependency on sparse neighboring pixels. To address this problem, a novel Entropy STN (ESTN) is proposed that interpolates on the data manifold distributions. In particular, random samples are generated for each pixel in association with the tangent space of the data manifold and construct a linear approximation of their intensity values with an entropy regularizer to compute the transformer parameters. A simple yet effective technique is also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Advanced Neural Network Applications · Image Processing Techniques and Applications

MethodsMulti-Head Attention · Attention Is All You Need · Adam · Label Smoothing · Layer Normalization · Absolute Position Encodings · Linear Layer · Softmax · Dense Connections · Dropout