Ruling Out to Rule In: Contrastive Hypothesis Retrieval for Medical Question Answering
Byeolhee Kim, Min-Kyung Kim, Young-Hak Kim, Tae-Joon Jeon

TL;DR
The paper introduces Contrastive Hypothesis Retrieval (CHR), a novel framework for medical question answering that explicitly models and suppresses plausible but incorrect hypotheses to improve retrieval accuracy.
Contribution
CHR is the first retrieval method to generate and contrast target and mimic hypotheses, significantly reducing hard-negative retrieval issues in medical QA.
Findings
CHR outperforms all five baselines across three benchmarks.
Up to 10.4 percentage points improvement over next-best methods.
85.2% of cases where CHR is correct but baseline isn't show no shared top-5 documents.
Abstract
Retrieval-augmented generation (RAG) grounds large language models in external medical knowledge, yet standard retrievers frequently surface hard negatives that are semantically close to the query but describe clinically distinct conditions. While existing query-expansion methods improve query representation to mitigate ambiguity, they typically focus on enriching target-relevant semantics without an explicit mechanism to selectively suppress specific, clinically plausible hard negatives. This leaves the system prone to retrieving plausible mimics that overshadow the actual diagnosis, particularly when such mimics are dominant within the corpus. We propose Contrastive Hypothesis Retrieval (CHR), a framework inspired by the process of clinical differential diagnosis. CHR generates a target hypothesis for the likely correct answer and a mimic hypothesis for the most plausible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
