Learning predictable and robust neural representations by straightening   image sequences

Xueyan Niu; Cristina Savin; Eero P. Simoncelli

arXiv:2411.01777·cs.CV·January 22, 2025

Learning predictable and robust neural representations by straightening image sequences

Xueyan Niu, Cristina Savin, Eero P. Simoncelli

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a self-supervised learning method that encourages neural representations to follow straighter temporal trajectories, leading to more predictive, robust, and disentangled features in image sequence models.

Contribution

The authors propose a novel straightening objective for SSL that improves predictability, robustness, and attribute disentanglement in neural representations, inspired by primate visual system findings.

Findings

01

Learned embeddings are predictive and disentangle object attributes.

02

Models trained with straightening are more noise- and adversarial-robust.

03

Straightening can be used as a regularizer for other training methods.

Abstract

Prediction is a fundamental capability of all living organisms, and has been proposed as an objective for learning sensory representations. Recent work demonstrates that in primate visual systems, prediction is facilitated by neural representations that follow straighter temporal trajectories than their initial photoreceptor encoding, which allows for prediction by linear extrapolation. Inspired by these experimental findings, we develop a self-supervised learning (SSL) objective that explicitly quantifies and promotes straightening. We demonstrate the power of this objective in training deep feedforward neural networks on smoothly-rendered synthetic image sequences that mimic commonly-occurring properties of natural videos. The learned model contains neural embeddings that are predictive, but also factorize the geometric, photometric, and semantic attributes of objects. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xyniu1/learning-by-straightening
noneOfficial

Videos

Learning predictable and robust neural representations by straightening image sequences· slideslive

Taxonomy

TopicsNeural Networks and Applications