TL;DR
This paper introduces a new benchmark and an asymmetric retrieval model for Chinese medical text retrieval, achieving high accuracy with low latency suitable for real-time applications.
Contribution
It presents CMedTEB, a high-quality Chinese medical text benchmark, and CARE, an asymmetric encoder architecture with a novel training strategy, advancing retrieval performance.
Findings
CARE outperforms state-of-the-art symmetric models on CMedTEB.
The benchmark CMedTEB is curated with expert validation to ensure high quality.
CARE achieves superior retrieval accuracy without increasing inference latency.
Abstract
Effective medical text retrieval requires both high accuracy and low latency. While LLM-based embedding models possess powerful retrieval capabilities, their prohibitive latency and high computational cost limit their application in real-time scenarios. Furthermore, the lack of comprehensive and high-fidelity benchmarks hinders progress in Chinese medical text retrieval. In this work, we introduce the Chinese Medical Text Embedding Benchmark (CMedTEB), a benchmark spanning three kinds of practical embedding tasks: retrieval, reranking, and semantic textual similarity (STS). Distinct from purely automated datasets, CMedTEB is curated via a rigorous multi-LLM voting pipeline validated by clinical experts, ensuring gold-standard label quality while effectively mitigating annotation noise. On this foundation, we propose the Chinese Medical Asymmetric REtriever (CARE), an asymmetric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
