Benchmarking and Enabling Efficient Chinese Medical Retrieval via Asymmetric Encoders

Angqing Jiang; Jianlyu Chen; Zhe Fang; Yongcan Wang; Xinpeng Li; Keyu Ding; Defu Lian

arXiv:2604.10937·cs.IR·April 21, 2026

Benchmarking and Enabling Efficient Chinese Medical Retrieval via Asymmetric Encoders

Angqing Jiang, Jianlyu Chen, Zhe Fang, Yongcan Wang, Xinpeng Li, Keyu Ding, Defu Lian

PDF

1 Models

TL;DR

This paper introduces a new benchmark and an asymmetric retrieval model for Chinese medical text retrieval, achieving high accuracy with low latency suitable for real-time applications.

Contribution

It presents CMedTEB, a high-quality Chinese medical text benchmark, and CARE, an asymmetric encoder architecture with a novel training strategy, advancing retrieval performance.

Findings

01

CARE outperforms state-of-the-art symmetric models on CMedTEB.

02

The benchmark CMedTEB is curated with expert validation to ensure high quality.

03

CARE achieves superior retrieval accuracy without increasing inference latency.

Abstract

Effective medical text retrieval requires both high accuracy and low latency. While LLM-based embedding models possess powerful retrieval capabilities, their prohibitive latency and high computational cost limit their application in real-time scenarios. Furthermore, the lack of comprehensive and high-fidelity benchmarks hinders progress in Chinese medical text retrieval. In this work, we introduce the Chinese Medical Text Embedding Benchmark (CMedTEB), a benchmark spanning three kinds of practical embedding tasks: retrieval, reranking, and semantic textual similarity (STS). Distinct from purely automated datasets, CMedTEB is curated via a rigorous multi-LLM voting pipeline validated by clinical experts, ensuring gold-standard label quality while effectively mitigating annotation noise. On this foundation, we propose the Chinese Medical Asymmetric REtriever (CARE), an asymmetric…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
PhilipGAQ/CARE-0.3B-4B
model· ♡ 2
♡ 2

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.