Entropy Transformer Networks: A Learning Approach via Tangent Bundle Data Manifold
Pourya Shamsolmoali, Masoumeh Zareapoor

TL;DR
This paper introduces Entropy Transformer Networks (ESTN), a novel interpolation method on data manifolds that enhances CNN performance under scale variations by preserving gradient norms and improving accuracy.
Contribution
The paper proposes ESTN, which interpolates on data manifolds using tangent spaces and entropy regularization, addressing limitations of standard spatial transformer networks.
Findings
Improves accuracy on image reconstruction and classification tasks.
Reduces computational cost compared to existing methods.
Enhances gradient norm preservation during training.
Abstract
This paper focuses on an accurate and fast interpolation approach for image transformation employed in the design of CNN architectures. Standard Spatial Transformer Networks (STNs) use bilinear or linear interpolation as their interpolation, with unrealistic assumptions about the underlying data distributions, which leads to poor performance under scale variations. Moreover, STNs do not preserve the norm of gradients in propagation due to their dependency on sparse neighboring pixels. To address this problem, a novel Entropy STN (ESTN) is proposed that interpolates on the data manifold distributions. In particular, random samples are generated for each pixel in association with the tangent space of the data manifold and construct a linear approximation of their intensity values with an entropy regularizer to compute the transformer parameters. A simple yet effective technique is also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Neural Network Applications · Image Processing Techniques and Applications
MethodsMulti-Head Attention · Attention Is All You Need · Adam · Label Smoothing · Layer Normalization · Absolute Position Encodings · Linear Layer · Softmax · Dense Connections · Dropout
