DxFormer: A Decoupled Automatic Diagnostic System Based on Decoder-Encoder Transformer with Dense Symptom Representations
Wei Chen, Cheng Zhong, Jiajie Peng, Zhongyu Wei

TL;DR
DxFormer is a novel decoupled diagnostic system using a decoder-encoder Transformer architecture that improves disease prediction accuracy by explicitly separating symptom inquiry and diagnosis, effectively learning clinical experience from data.
Contribution
The paper introduces DxFormer, a decoupled Transformer-based framework that separates symptom inquiry and disease diagnosis, achieving state-of-the-art results in automatic medical diagnosis.
Findings
Achieves superior symptom recall and diagnostic accuracy.
Effectively learns clinical experience from datasets.
Outperforms existing reinforcement learning and non-RL methods.
Abstract
Diagnosis-oriented dialogue system queries the patient's health condition and makes predictions about possible diseases through continuous interaction with the patient. A few studies use reinforcement learning (RL) to learn the optimal policy from the joint action space of symptoms and diseases. However, existing RL (or Non-RL) methods cannot achieve sufficiently good prediction accuracy, still far from its upper limit. To address the problem, we propose a decoupled automatic diagnostic framework DxFormer, which divides the diagnosis process into two steps: symptom inquiry and disease diagnosis, where the transition from symptom inquiry to disease diagnosis is explicitly determined by the stopping criteria. In DxFormer, we treat each symptom as a token, and formalize the symptom inquiry and disease diagnosis to a language generation model and a sequence classification model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Topic Modeling
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Absolute Position Encodings · Residual Connection · Position-Wise Feed-Forward Layer · Dense Connections · Dropout · Softmax
