OTCXR: Rethinking Self-supervised Alignment using Optimal Transport for Chest X-ray Analysis
Vandan Gorade, Azad Singh, and Deepak Mishra

TL;DR
OTCXR introduces an optimal transport-based self-supervised learning framework for chest X-ray analysis, improving semantic alignment and feature representation by capturing both local and global contextual information.
Contribution
The paper proposes OTCXR, a novel SSL method that integrates optimal transport with a cross-viewpoint semantics module to enhance semantic invariance in chest X-ray representations.
Findings
OTCXR outperforms state-of-the-art methods on multiple chest X-ray datasets.
The framework effectively captures both local details and global context.
Enhanced representations improve thoracic disease diagnosis accuracy.
Abstract
Self-supervised learning (SSL) has emerged as a promising technique for analyzing medical modalities such as X-rays due to its ability to learn without annotations. However, conventional SSL methods face challenges in achieving semantic alignment and capturing subtle details, which limits their ability to accurately represent the underlying anatomical structures and pathological features. To address these limitations, we propose OTCXR, a novel SSL framework that leverages optimal transport (OT) to learn dense semantic invariance. By integrating OT with our innovative Cross-Viewpoint Semantics Infusion Module (CV-SIM), OTCXR enhances the model's ability to capture not only local spatial features but also global contextual dependencies across different viewpoints. This approach enriches the effectiveness of SSL in the context of chest radiographs. Furthermore, OTCXR incorporates variance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Image Retrieval and Classification Techniques · Brain Tumor Detection and Classification
MethodsFocus
