Concept-Guided Noisy Negative Suppression for Zero-Shot Classification and Grounding of Chest X-Ray Findings

Chenyu Lian; Hong-Yu Zhou; Chun-Ka Wong; Jing Qin

arXiv:2605.19374·cs.CV·May 20, 2026

Concept-Guided Noisy Negative Suppression for Zero-Shot Classification and Grounding of Chest X-Ray Findings

Chenyu Lian, Hong-Yu Zhou, Chun-Ka Wong, Jing Qin

PDF

1 Repo

TL;DR

This paper introduces CoNNS, a novel framework that improves zero-shot classification and grounding of chest X-ray findings by suppressing noisy negatives through a hierarchical concept ontology and a concept-aware loss.

Contribution

It presents a concept-guided noisy-negative suppression method using a hierarchical ontology and relabeling strategies to enhance zero-shot medical image understanding.

Findings

01

Outperforms state-of-the-art models on multiple zero-shot tasks

02

Effectively suppresses noisy negatives to improve semantic alignment

03

Achieves superior accuracy in zero-shot classification and grounding

Abstract

Vision-language alignment using chest X-rays and radiology reports has emerged as an advanced paradigm for zero-shot classification and grounding of chest X-ray findings. However, standard contrastive learning typically treats radiographs and reports from different patients simply as negative pairs. This assumption introduces noisy negatives, as different patients frequently exhibit similar findings. Such noisy negatives cause semantic ambiguity and degrade performance in zero-shot understanding tasks. To address this challenge, we propose CoNNS, a concept-guided noisy-negative suppression framework. To support the negative suppression mechanism, unlike previous methods that use raw reports or templatized texts, we construct a hierarchical concept ontology using large language models. The ontology structures 41 key clinical concepts by explicitly modeling presence, attributes (location…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DopamineLcy/conns
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.