Neuro-Inspired Hierarchical Multimodal Learning
Xiongye Xiao, Gengshuo Liu, Gaurav Gupta, Defu Cao, Shixuan Li, Yaxing, Li, Tianqing Fang, Mingxi Cheng, Paul Bogdan

TL;DR
This paper introduces a neuroscience-inspired hierarchical multimodal learning model that uses an information bottleneck approach to create compact, relevant representations from multiple data sources, improving performance on perception tasks.
Contribution
The proposed ITHP model uniquely designates a primary modality and uses an information bottleneck to enhance multimodal perception, differing from traditional fusion methods.
Findings
Outperforms state-of-the-art benchmarks on MUStARD and CMU-MOSI datasets.
Creates compact, relevant latent representations for multimodal data.
Effectively balances information retention and redundancy minimization.
Abstract
Integrating and processing information from various sources or modalities are critical for obtaining a comprehensive and accurate perception of the real world. Drawing inspiration from neuroscience, we develop the Information-Theoretic Hierarchical Perception (ITHP) model, which utilizes the concept of information bottleneck. Distinct from most traditional fusion models that aim to incorporate all modalities as input, our model designates the prime modality as input, while the remaining modalities act as detectors in the information pathway. Our proposed perception model focuses on constructing an effective and compact information flow by achieving a balance between the minimization of mutual information between the latent state and the input modal state, and the maximization of mutual information between the latent states and the remaining modal states. This approach leads to compact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Domain Adaptation and Few-Shot Learning · Advanced Memory and Neural Computing
