NeuralOOD: Improving Out-of-Distribution Generalization Performance with   Brain-machine Fusion Learning Framework

Shuangchen Zhao; Changde Du; Hui Li; Huiguang He

arXiv:2408.14950·cs.CV·August 28, 2024

NeuralOOD: Improving Out-of-Distribution Generalization Performance with Brain-machine Fusion Learning Framework

Shuangchen Zhao, Changde Du, Hui Li, Huiguang He

PDF

Open Access

TL;DR

NeuralOOD introduces a multimodal brain-machine fusion framework that leverages cognitive prior knowledge and visual data to enhance out-of-distribution generalization in computer vision tasks, outperforming existing models.

Contribution

The paper proposes a novel Brain-machine Fusion Learning framework using cross-attention and a brain transformer, integrating fMRI-based prior knowledge without requiring fMRI data collection.

Findings

01

Outperforms DINOv2 and baseline models on ImageNet-1k.

02

Achieves superior accuracy on six curated OOD datasets.

03

Effectively integrates multimodal knowledge for robust OOD generalization.

Abstract

Deep Neural Networks (DNNs) have demonstrated exceptional recognition capabilities in traditional computer vision (CV) tasks. However, existing CV models often suffer a significant decrease in accuracy when confronted with out-of-distribution (OOD) data. In contrast to these DNN models, human can maintain a consistently low error rate when facing OOD scenes, partly attributed to the rich prior cognitive knowledge stored in the human brain. Previous OOD generalization researches only focus on the single modal, overlooking the advantages of multimodal learning method. In this paper, we utilize the multimodal learning method to improve the OOD generalization and propose a novel Brain-machine Fusion Learning (BMFL) framework. We adopt the cross-attention mechanism to fuse the visual knowledge from CV model and prior cognitive knowledge from the human brain. Specially, we employ a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsFocus