TL;DR
This paper introduces CoNNS, a novel framework that improves zero-shot classification and grounding of chest X-ray findings by suppressing noisy negatives through a hierarchical concept ontology and a concept-aware loss.
Contribution
It presents a concept-guided noisy-negative suppression method using a hierarchical ontology and relabeling strategies to enhance zero-shot medical image understanding.
Findings
Outperforms state-of-the-art models on multiple zero-shot tasks
Effectively suppresses noisy negatives to improve semantic alignment
Achieves superior accuracy in zero-shot classification and grounding
Abstract
Vision-language alignment using chest X-rays and radiology reports has emerged as an advanced paradigm for zero-shot classification and grounding of chest X-ray findings. However, standard contrastive learning typically treats radiographs and reports from different patients simply as negative pairs. This assumption introduces noisy negatives, as different patients frequently exhibit similar findings. Such noisy negatives cause semantic ambiguity and degrade performance in zero-shot understanding tasks. To address this challenge, we propose CoNNS, a concept-guided noisy-negative suppression framework. To support the negative suppression mechanism, unlike previous methods that use raw reports or templatized texts, we construct a hierarchical concept ontology using large language models. The ontology structures 41 key clinical concepts by explicitly modeling presence, attributes (location…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
