DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object   Re-Identification

Yuhao Wang; Yang Liu; Aihua Zheng; Pingping Zhang

arXiv:2412.10650·cs.CV·December 17, 2024

DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification

Yuhao Wang, Yang Liu, Aihua Zheng, Pingping Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

DeMo introduces a novel framework for multi-modal object ReID that decouples modality-specific features and adaptively balances them using a mixture of experts, leading to more robust multi-modal feature representations.

Contribution

The paper proposes a new feature learning framework with decoupled features and dynamic attention-based mixture of experts for improved multi-modal object ReID.

Findings

01

Outperforms existing methods on three multi-modal ReID benchmarks.

02

Effectively preserves modality-specific information through hierarchical decoupling.

03

Enhances robustness of features via dynamic attention in the mixture of experts.

Abstract

Multi-modal object Re-IDentification (ReID) aims to retrieve specific objects by combining complementary information from multiple modalities. Existing multi-modal object ReID methods primarily focus on the fusion of heterogeneous features. However, they often overlook the dynamic quality changes in multi-modal imaging. In addition, the shared information between different modalities can weaken modality-specific information. To address these issues, we propose a novel feature learning framework called DeMo for multi-modal object ReID, which adaptively balances decoupled features using a mixture of experts. To be specific, we first deploy a Patch-Integrated Feature Extractor (PIFE) to extract multi-granularity and multi-modal features. Then, we introduce a Hierarchical Decoupling Module (HDM) to decouple multi-modal features into non-overlapping forms, preserving the modality uniqueness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

924973292/demo
pytorchOfficial

Videos

DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification· underline

Taxonomy

TopicsAnomaly Detection Techniques and Applications

MethodsSoftmax · Attention Is All You Need · Focus