Learning predictable and robust neural representations by straightening image sequences
Xueyan Niu, Cristina Savin, Eero P. Simoncelli

TL;DR
This paper introduces a self-supervised learning method that encourages neural representations to follow straighter temporal trajectories, leading to more predictive, robust, and disentangled features in image sequence models.
Contribution
The authors propose a novel straightening objective for SSL that improves predictability, robustness, and attribute disentanglement in neural representations, inspired by primate visual system findings.
Findings
Learned embeddings are predictive and disentangle object attributes.
Models trained with straightening are more noise- and adversarial-robust.
Straightening can be used as a regularizer for other training methods.
Abstract
Prediction is a fundamental capability of all living organisms, and has been proposed as an objective for learning sensory representations. Recent work demonstrates that in primate visual systems, prediction is facilitated by neural representations that follow straighter temporal trajectories than their initial photoreceptor encoding, which allows for prediction by linear extrapolation. Inspired by these experimental findings, we develop a self-supervised learning (SSL) objective that explicitly quantifies and promotes straightening. We demonstrate the power of this objective in training deep feedforward neural networks on smoothly-rendered synthetic image sequences that mimic commonly-occurring properties of natural videos. The learned model contains neural embeddings that are predictive, but also factorize the geometric, photometric, and semantic attributes of objects. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications
