Learning Discriminative Visual-Text Representation for Polyp Re-Identification
Suncheng Xiang, Cang Liu, Sijia Du, Dahong Qian

TL;DR
This paper introduces VT-ReID, a novel training approach that combines visual and semantic features with clustering to improve colonoscopic polyp re-identification, significantly outperforming existing methods.
Contribution
It presents the first use of visual-text features with clustering for polyp re-identification, enhancing representation and generalization capabilities.
Findings
Significant performance improvement over state-of-the-art methods
Effective integration of semantic features via contrastive learning
Novel clustering mechanism leveraging textual data
Abstract
Colonoscopic Polyp Re-Identification aims to match a specific polyp in a large gallery with different cameras and views, which plays a key role for the prevention and treatment of colorectal cancer in the computer-aided diagnosis. However, traditional methods mainly focus on the visual representation learning, while neglect to explore the potential of semantic features during training, which may easily leads to poor generalization capability when adapted the pretrained model into the new scenarios. To relieve this dilemma, we propose a simple but effective training method named VT-ReID, which can remarkably enrich the representation of polyp videos with the interchange of high-level semantic information. Moreover, we elaborately design a novel clustering mechanism to introduce prior knowledge from textual data, which leverages contrastive learning to promote better separation from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Video Analysis and Summarization
MethodsContrastive Learning · Focus
