Learning Discriminative Visual-Text Representation for Polyp   Re-Identification

Suncheng Xiang; Cang Liu; Sijia Du; Dahong Qian

arXiv:2307.10625·cs.CV·July 21, 2023

Learning Discriminative Visual-Text Representation for Polyp Re-Identification

Suncheng Xiang, Cang Liu, Sijia Du, Dahong Qian

PDF

Open Access 1 Repo

TL;DR

This paper introduces VT-ReID, a novel training approach that combines visual and semantic features with clustering to improve colonoscopic polyp re-identification, significantly outperforming existing methods.

Contribution

It presents the first use of visual-text features with clustering for polyp re-identification, enhancing representation and generalization capabilities.

Findings

01

Significant performance improvement over state-of-the-art methods

02

Effective integration of semantic features via contrastive learning

03

Novel clustering mechanism leveraging textual data

Abstract

Colonoscopic Polyp Re-Identification aims to match a specific polyp in a large gallery with different cameras and views, which plays a key role for the prevention and treatment of colorectal cancer in the computer-aided diagnosis. However, traditional methods mainly focus on the visual representation learning, while neglect to explore the potential of semantic features during training, which may easily leads to poor generalization capability when adapted the pretrained model into the new scenarios. To relieve this dilemma, we propose a simple but effective training method named VT-ReID, which can remarkably enrich the representation of polyp videos with the interchange of high-level semantic information. Moreover, we elaborately design a novel clustering mechanism to introduce prior knowledge from textual data, which leverages contrastive learning to promote better separation from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jeremyxsc/vt-reid
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Video Analysis and Summarization

MethodsContrastive Learning · Focus