A Framework for Learning Invariant Physical Relations in Multimodal Sensory Processing
Du Xiaorui, Yavuzhan Erdem, Immanuel Schweizer, Cristian Axenie

TL;DR
This paper presents a neural network framework that learns invariant physical relations from multimodal sensory data in an unsupervised, stable, and noise-tolerant manner, inspired by perceptual learning in humans.
Contribution
It introduces a novel neural architecture capable of unsupervised learning of non-linear relations among multiple sensory cues without prior sensor modeling.
Findings
Successfully learned physical relations from RGB images such as light intensity, gradient, and optical flow.
Achieved stable convergence in high-dimensional multisensory data.
Demonstrated robustness to noise and missing sensor inputs.
Abstract
Perceptual learning enables humans to recognize and represent stimuli invariant to various transformations and build a consistent representation of the self and physical world. Such representations preserve the invariant physical relations among the multiple perceived sensory cues. This work is an attempt to exploit these principles in an engineered system. We design a novel neural network architecture capable of learning, in an unsupervised manner, relations among multiple sensory cues. The system combines computational principles, such as competition, cooperation, and correlation, in a neurally plausible computational substrate. It achieves that through a parallel and distributed processing architecture in which the relations among the multiple sensory quantities are extracted from time-sequenced data. We describe the core system functionality when learning arbitrary non-linear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural dynamics and brain function · Visual perception and processing mechanisms · Advanced Vision and Imaging
