Incorporating brain-inspired mechanisms for multimodal learning in artificial intelligence
Xiang He, Dongcheng Zhao, Yang Li, Qingqun Kong, Xin Yang, Yi Zeng

TL;DR
This paper introduces an inverse effectiveness driven multimodal fusion (IEMF) strategy inspired by brain mechanisms, improving multimodal AI performance and efficiency across various tasks and neural network types.
Contribution
It proposes a novel biologically inspired fusion method that enhances integration efficiency and reduces computational costs in multimodal neural networks.
Findings
Achieves up to 50% reduction in computational cost.
Demonstrates improved performance on audio-visual classification, continual learning, and question answering.
Shows good adaptability to both ANN and SNN architectures.
Abstract
Multimodal learning enhances the perceptual capabilities of cognitive systems by integrating information from different sensory modalities. However, existing multimodal fusion research typically assumes static integration, not fully incorporating key dynamic mechanisms found in the brain. Specifically, the brain exhibits an inverse effectiveness phenomenon, wherein weaker unimodal cues yield stronger multisensory integration benefits; conversely, when individual modal cues are stronger, the effect of fusion is diminished. This mechanism enables biological systems to achieve robust cognition even with scarce or noisy perceptual cues. Inspired by this biological mechanism, we explore the relationship between multimodal output and information from individual modalities, proposing an inverse effectiveness driven multimodal fusion (IEMF) strategy. By incorporating this strategy into neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems
