Adaptation of Autoencoder for Sparsity Reduction From Clinical Notes Representation Learning
Thanh-Dung Le, Rita Noumeir, Jerome Rambaud, Guillaume Sans, and, Philippe Jouvet

TL;DR
This paper introduces an autoencoder-based method to reduce sparsity in clinical note representations, improving classification performance on small datasets with high-dimensional, sparse features.
Contribution
It proposes a novel autoencoder approach for compressing sparse clinical text features, enhancing classification accuracy and demonstrating the method within the information bottleneck framework.
Findings
Performance gains of up to 3% in classification metrics.
Achieved 92% accuracy and 91% recall in patient condition detection.
Validated the compression mechanism using the information bottleneck theory.
Abstract
When dealing with clinical text classification on a small dataset recent studies have confirmed that a well-tuned multilayer perceptron outperforms other generative classifiers, including deep learning ones. To increase the performance of the neural network classifier, feature selection for the learning representation can effectively be used. However, most feature selection methods only estimate the degree of linear dependency between variables and select the best features based on univariate statistical tests. Furthermore, the sparsity of the feature space involved in the learning representation is ignored. Goal: Our aim is therefore to access an alternative approach to tackle the sparsity by compressing the clinical representation feature space, where limited French clinical notes can also be dealt with effectively. Methods: This study proposed an autoencoder learning algorithm to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection
MethodsFeature Selection
