Medical symptom recognition from patient text: An active learning approach for long-tailed multilabel distributions
Ali Mottaghi, Prathusha K Sarma, Xavier Amatriain, Serena Yeung,, Anitha Kannan

TL;DR
This paper presents an active learning approach for recognizing multiple medical symptoms from patient texts, addressing data scarcity and long-tailed symptom distributions to improve symptom coverage.
Contribution
It introduces a novel active learning method that leverages a learned latent space to select informative examples, enhancing multi-label symptom recognition in long-tailed data.
Findings
Improved symptom coverage with fewer labeled examples
Effective handling of long-tailed symptom distribution
Enhanced multi-label recognition accuracy
Abstract
We study the problem of medical symptoms recognition from patient text, for the purposes of gathering pertinent information from the patient (known as history-taking). A typical patient text is often descriptive of the symptoms the patient is experiencing and a single instance of such a text can be "labeled" with multiple symptoms. This makes learning a medical symptoms recognizer challenging on account of i) the lack of availability of voluminous annotated data as well as ii) the large unknown universe of multiple symptoms that a single text can map to. Furthermore, patient text is often characterized by a long tail in the data (i.e., some labels/symptoms occur more frequently than others for e.g "fever" vs "hematochezia"). In this paper, we introduce an active learning method that leverages underlying structure of a continually refined, learned latent space to select the most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning and Algorithms · Biomedical Text Mining and Ontologies
