MedCT: A Clinical Terminology Graph for Generative AI Applications in Healthcare
Ye Chen, Dongdong Huang, Haoyun Xu, Cong Fu, Lin Sheng, Qingli Zhou,, Yuqiang Shen, Kai Wang

TL;DR
This paper introduces MedCT, a comprehensive Chinese clinical terminology system with associated models, enhancing clinical data standardization, reducing LLM hallucinations, and improving healthcare applications in China and beyond.
Contribution
The paper presents the first Chinese clinical terminology MedCT, along with foundation and entity linking models, enabling rapid deployment and state-of-the-art performance in clinical NLP tasks.
Findings
MedCT achieves SOTA performance in semantic matching and entity linking.
The system effectively reduces hallucination in LLM-based clinical applications.
MedCT is deployable within three months, outperforming traditional terminologies.
Abstract
We introduce the world's first clinical terminology for the Chinese healthcare community, namely MedCT, accompanied by a clinical foundation model MedBERT and an entity linking model MedLink. The MedCT system enables standardized and programmable representation of Chinese clinical data, successively stimulating the development of new medicines, treatment pathways, and better patient outcomes for the populous Chinese community. Moreover, the MedCT knowledge graph provides a principled mechanism to minimize the hallucination problem of large language models (LLMs), therefore achieving significant levels of accuracy and safety in LLM-based clinical applications. By leveraging the LLMs' emergent capabilities of generativeness and expressiveness, we were able to rapidly built a production-quality terminology system and deployed to real-world clinical field within three months, while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
