Applications of Machine Learning in Pharmacogenomics: Clustering Plasma Concentration-Time Curves
Jackson P. Lautier, Stella Grosser, Jessica Kim, Hyewon Kim, Junghi, Kim

TL;DR
This paper explores unsupervised machine learning clustering of plasma concentration-time curves to identify patterns and validate pharmacogenomic findings, demonstrating Euclidean distance as most effective for time series clustering.
Contribution
It introduces a novel application of time series clustering to pharmacokinetic data, validating pharmacogenomic results without genetic information and highlighting insights beyond summary statistics.
Findings
Euclidean distance is most suitable for clustering PK curves.
Unsupervised clustering validated pharmacogenomic conclusions.
Clustering revealed patterns not evident from summary statistics.
Abstract
Pharmaceutical researchers are continually searching for techniques to improve both drug development processes and patient outcomes. An area of recent interest is the potential for machine learning (ML) applications within pharmacology. One such application not yet given close study is the unsupervised clustering of plasma concentration-time curves, hereafter, pharmacokinetic (PK) curves. In this paper, we present our findings on how to cluster PK curves by their similarity. Specifically, we find clustering to be effective at identifying similar-shaped PK curves and informative for understanding patterns within each cluster of PK curves. Because PK curves are time series data objects, our approach utilizes the extensive body of research related to the clustering of time series data as a starting point. As such, we examine many dissimilarity measures between time series data objects to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Metabolomics and Mass Spectrometry Studies · Data Stream Mining Techniques
