The Use of Autoencoders for Discovering Patient Phenotypes

Harini Suresh; Peter Szolovits; Marzyeh Ghassemi

arXiv:1703.07004·cs.LG·March 22, 2017·21 cites

The Use of Autoencoders for Discovering Patient Phenotypes

Harini Suresh, Peter Szolovits, Marzyeh Ghassemi

PDF

Open Access

TL;DR

This paper explores the application of autoencoders to identify patient phenotypes by creating low-dimensional representations, comparing different autoencoder architectures on a large clinical dataset.

Contribution

It introduces a comparison between fixed-input and sequence-to-sequence autoencoders for patient phenotype discovery using clinical time series data.

Findings

01

Autoencoders effectively capture meaningful patient phenotypes.

02

Sequence-to-sequence autoencoders outperform fixed-input models.

03

The approach scales to large clinical datasets like MIMIC III.

Abstract

We use autoencoders to create low-dimensional embeddings of underlying patient phenotypes that we hypothesize are a governing factor in determining how different patients will react to different interventions. We compare the performance of autoencoders that take fixed length sequences of concatenated timesteps as input with a recurrent sequence-to-sequence autoencoder. We evaluate our methods on around 35,500 patients from the latest MIMIC III dataset from Beth Israel Deaconess Hospital.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Topic Modeling · Biomedical Text Mining and Ontologies