Interpretable Machine Learning for Genomics
David S. Watson

TL;DR
This paper introduces interpretable machine learning techniques tailored for genomics, emphasizing their importance in understanding biological data and advancing precision medicine, while discussing current challenges and future directions.
Contribution
It provides a comprehensive overview of iML concepts, methodologies, and applications in genomics, highlighting recent advances and identifying open challenges for future research.
Findings
iML techniques are increasingly integrated into genomics research workflows
Current tools have limitations that hinder interpretability and applicability
Future research directions include cross-disciplinary collaboration and methodological improvements
Abstract
High-throughput technologies such as next generation sequencing allow biologists to observe cell function with unprecedented resolution, but the resulting datasets are too large and complicated for humans to understand without the aid of advanced statistical methods. Machine learning (ML) algorithms, which are designed to automatically find patterns in data, are well suited to this task. Yet these models are often so complex as to be opaque, leaving researchers with few clues about underlying mechanisms. Interpretable machine learning (iML) is a burgeoning subdiscipline of computational statistics devoted to making the predictions of ML models more intelligible to end users. This article is a gentle and critical introduction to iML, with an emphasis on genomic applications. I define relevant concepts, motivate leading methodologies, and provide a simple typology of existing approaches.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
