Knowledge-driven Subspace Fusion and Gradient Coordination for   Multi-modal Learning

Yupei Zhang; Xiaofei Wang; Fangliangzi Meng; Jin Tang; Chao Li

arXiv:2406.13979·eess.IV·June 21, 2024

Knowledge-driven Subspace Fusion and Gradient Coordination for Multi-modal Learning

Yupei Zhang, Xiaofei Wang, Fangliangzi Meng, Jin Tang, Chao Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a biologically interpretable multi-modal learning framework that effectively integrates histology images and genomics data for cancer diagnosis, leveraging subspace fusion and gradient coordination to improve performance.

Contribution

It proposes a novel knowledge-driven subspace fusion scheme and a gradient coordination strategy to enhance multi-modal learning in cancer analysis.

Findings

01

Outperforms state-of-the-art methods in glioma diagnosis, grading, and survival analysis.

02

Demonstrates robustness and interpretability in integrating histology and genomics data.

03

Effective in modeling complex tumor and microenvironment interactions.

Abstract

Multi-modal learning plays a crucial role in cancer diagnosis and prognosis. Current deep learning based multi-modal approaches are often limited by their abilities to model the complex correlations between genomics and histology data, addressing the intrinsic complexity of tumour ecosystem where both tumour and microenvironment contribute to malignancy. We propose a biologically interpretative and robust multi-modal learning framework to efficiently integrate histology images and genomics by decomposing the feature subspace of histology images and genomics, reflecting distinct tumour and microenvironment features. To enhance cross-modal interactions, we design a knowledge-driven subspace fusion scheme, consisting of a cross-modal deformable attention module and a gene-guided consistency strategy. Additionally, in pursuit of dynamically optimizing the subspace knowledge, we further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

helenypzhang/subspace-multimodal-learning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRough Sets and Fuzzy Logic · Multi-Criteria Decision Making · Text and Document Classification Technologies

MethodsSoftmax · Attention Is All You Need · Deformable Attention Module