Learning Collaborative Knowledge with Multimodal Representation for Polyp Re-Identification

Suncheng Xiang; Jiale Guan; Shilun Cai; Jiacheng Ruan; Dahong Qian

arXiv:2408.05914·cs.CV·October 22, 2025

Learning Collaborative Knowledge with Multimodal Representation for Polyp Re-Identification

Suncheng Xiang, Jiale Guan, Shilun Cai, Jiacheng Ruan, Dahong Qian

PDF

Open Access 1 Repo

TL;DR

This paper introduces a deep multimodal learning framework for colonoscopic polyp re-identification, improving retrieval accuracy by leveraging visual and text modalities and collaborative fusion strategies to address domain gaps.

Contribution

It proposes a novel DMCL framework with dynamic multimodal fusion, enhancing generalization and performance in medical polyp re-identification tasks.

Findings

01

Multimodal approach outperforms unimodal models on benchmarks.

02

Collaborative fusion improves retrieval accuracy.

03

End-to-end training effectively integrates visual and text features.

Abstract

Colonoscopic Polyp Re-Identification aims to match the same polyp from a large gallery with images from different views taken using different cameras, which plays an important role in the prevention and treatment of colorectal cancer in computer-aided diagnosis. However, traditional methods for object ReID directly adopting CNN models trained on the ImageNet dataset usually produce unsatisfactory retrieval performance on colonoscopic datasets due to the large domain gap. Worsely, these solutions typically learn unimodal modal representations on the basis of visual samples, which fails to explore complementary information from other different modalities. To address this challenge, we propose a novel Deep Multimodal Collaborative Learning framework named DMCL for polyp re-identification, which can effectively encourage multimodal knowledge collaboration and reinforce generalization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jeremyxsc/dmcl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLaser and Thermal Forming Techniques · Applied Advanced Technologies