DxFormer: A Decoupled Automatic Diagnostic System Based on   Decoder-Encoder Transformer with Dense Symptom Representations

Wei Chen; Cheng Zhong; Jiajie Peng; Zhongyu Wei

arXiv:2205.03755·cs.CL·December 27, 2022

DxFormer: A Decoupled Automatic Diagnostic System Based on Decoder-Encoder Transformer with Dense Symptom Representations

Wei Chen, Cheng Zhong, Jiajie Peng, Zhongyu Wei

PDF

Open Access 1 Repo

TL;DR

DxFormer is a novel decoupled diagnostic system using a decoder-encoder Transformer architecture that improves disease prediction accuracy by explicitly separating symptom inquiry and diagnosis, effectively learning clinical experience from data.

Contribution

The paper introduces DxFormer, a decoupled Transformer-based framework that separates symptom inquiry and disease diagnosis, achieving state-of-the-art results in automatic medical diagnosis.

Findings

01

Achieves superior symptom recall and diagnostic accuracy.

02

Effectively learns clinical experience from datasets.

03

Outperforms existing reinforcement learning and non-RL methods.

Abstract

Diagnosis-oriented dialogue system queries the patient's health condition and makes predictions about possible diseases through continuous interaction with the patient. A few studies use reinforcement learning (RL) to learn the optimal policy from the joint action space of symptoms and diseases. However, existing RL (or Non-RL) methods cannot achieve sufficiently good prediction accuracy, still far from its upper limit. To address the problem, we propose a decoupled automatic diagnostic framework DxFormer, which divides the diagnosis process into two steps: symptom inquiry and disease diagnosis, where the transition from symptom inquiry to disease diagnosis is explicitly determined by the stopping criteria. In DxFormer, we treat each symptom as a token, and formalize the symptom inquiry and disease diagnosis to a language generation model and a sequence classification model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lemuria-wchen/dxformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Topic Modeling

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Absolute Position Encodings · Residual Connection · Position-Wise Feed-Forward Layer · Dense Connections · Dropout · Softmax