Noise Pollution in Hospital Readmission Prediction: Long Document Classification with Reinforcement Learning
Liyan Xu, Julien Hogan, Rachel E. Patzer, Jinho D. Choi

TL;DR
This paper introduces a reinforcement learning method to identify and remove noise from long clinical documents, improving readmission prediction accuracy in small datasets with complex, noisy text.
Contribution
It proposes a novel reinforcement learning approach for noise extraction in long clinical texts, demonstrating effectiveness over deep learning encoders and baseline models.
Findings
Bag-of-words encoder outperforms deep learning encoders
Reinforcement learning prunes 25% of text segments
RL identifies both generic and task-specific noisy tokens
Abstract
This paper presents a reinforcement learning approach to extract noise in long clinical documents for the task of readmission prediction after kidney transplant. We face the challenges of developing robust models on a small dataset where each document may consist of over 10K tokens with full of noise including tabular text and task-irrelevant sentences. We first experiment four types of encoders to empirically decide the best document representation, and then apply reinforcement learning to remove noisy text from the long documents, which models the noise extraction process as a sequential decision problem. Our results show that the old bag-of-words encoder outperforms deep learning-based encoders on this task, and reinforcement learning is able to improve upon baseline while pruning out 25% text segments. Our analysis depicts that reinforcement learning is able to identify both typical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Healthcare · Domain Adaptation and Few-Shot Learning
MethodsPruning
