Neural Networks Learn Distance Metrics

Alan Oursland

arXiv:2502.02103·cs.LG·February 5, 2025

Neural Networks Learn Distance Metrics

Alan Oursland

PDF

Open Access 1 Repo

TL;DR

This paper investigates how neural networks naturally favor distance-based representations over intensity-based ones, affecting performance, and introduces a new architecture, OffsetL2, to validate this geometric framework.

Contribution

It demonstrates the impact of representation type on neural network performance and proposes a novel distance-based architecture, OffsetL2, grounded in a new geometric framework.

Findings

01

Distance representations improve model performance

02

OffsetL2 architecture validates the geometric framework

03

Distance-based learning is crucial in neural network design

Abstract

Neural networks may naturally favor distance-based representations, where smaller activations indicate closer proximity to learned prototypes. This contrasts with intensity-based approaches, which rely on activation magnitudes. To test this hypothesis, we conducted experiments with six MNIST architectural variants constrained to learn either distance or intensity representations. Our results reveal that the underlying representation affects model performance. We develop a novel geometric framework that explains these findings and introduce OffsetL2, a new architecture based on Mahalanobis distance equations, to further validate this framework. This work highlights the importance of considering distance-based learning in neural network design.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alanoursland/neural_networks_learn_distance_metrics
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Face and Expression Recognition