NeuralOOD: Improving Out-of-Distribution Generalization Performance with Brain-machine Fusion Learning Framework
Shuangchen Zhao, Changde Du, Hui Li, Huiguang He

TL;DR
NeuralOOD introduces a multimodal brain-machine fusion framework that leverages cognitive prior knowledge and visual data to enhance out-of-distribution generalization in computer vision tasks, outperforming existing models.
Contribution
The paper proposes a novel Brain-machine Fusion Learning framework using cross-attention and a brain transformer, integrating fMRI-based prior knowledge without requiring fMRI data collection.
Findings
Outperforms DINOv2 and baseline models on ImageNet-1k.
Achieves superior accuracy on six curated OOD datasets.
Effectively integrates multimodal knowledge for robust OOD generalization.
Abstract
Deep Neural Networks (DNNs) have demonstrated exceptional recognition capabilities in traditional computer vision (CV) tasks. However, existing CV models often suffer a significant decrease in accuracy when confronted with out-of-distribution (OOD) data. In contrast to these DNN models, human can maintain a consistently low error rate when facing OOD scenes, partly attributed to the rich prior cognitive knowledge stored in the human brain. Previous OOD generalization researches only focus on the single modal, overlooking the advantages of multimodal learning method. In this paper, we utilize the multimodal learning method to improve the OOD generalization and propose a novel Brain-machine Fusion Learning (BMFL) framework. We adopt the cross-attention mechanism to fuse the visual knowledge from CV model and prior cognitive knowledge from the human brain. Specially, we employ a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsFocus
