Text2MDT: Extracting Medical Decision Trees from Medical Texts
Wei Zhu, Wenfeng Li, Xing Tian, Pengfei Wang, Xiaoling, Wang, Jin Chen, Yuanbin Wu, Yuan Ni, Guotong Xie

TL;DR
This paper introduces Text2MDT, a novel approach for automatically extracting medical decision trees from Chinese medical texts using large language models and a new annotated dataset, improving efficiency over manual methods.
Contribution
The work presents a new task, a Chinese dataset, and two methods—end-to-end LLM-based and pipeline—for extracting medical decision trees from texts.
Findings
End-to-end LLM method outperforms pipeline approaches.
Chain-of-thought prompting enhances LLM performance.
Lightweight models achieve comparable results to large LLMs.
Abstract
Knowledge of the medical decision process, which can be modeled as medical decision trees (MDTs), is critical to build clinical decision support systems. However, the current MDT construction methods rely heavily on time-consuming and laborious manual annotation. In this work, we propose a novel task, Text2MDT, to explore the automatic extraction of MDTs from medical texts such as medical guidelines and textbooks. We normalize the form of the MDT and create an annotated Text-to-MDT dataset in Chinese with the participation of medical experts. We investigate two different methods for the Text2MDT tasks: (a) an end-to-end framework which only relies on a GPT style large language models (LLM) instruction tuning to generate all the node information and tree structures. (b) The pipeline framework which decomposes the Text2MDT task to three subtasks. Experiments on our Text2MDT dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Healthcare · Biomedical Text Mining and Ontologies
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Adam · Discriminative Fine-Tuning · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Residual Connection · Attention Dropout
