# A Clinical Approach to Training Effective Data Scientists

**Authors:** Kit T Rodolfa, Adolfo De Unanue, Matt Gee, Rayid Ghani

arXiv: 1905.06875 · 2020-01-28

## TL;DR

This paper proposes a practical, residency-style data science master's program emphasizing real-world problem solving through industry partnerships, aiming to improve the relevance and readiness of data science education.

## Contribution

It introduces a novel residency-based curriculum model for data science education inspired by medical training, focusing on hands-on experience with real-world problems.

## Key findings

- Enhanced practical skills through industry collaboration
- Bridging the gap between theory and real-world application
- Potential for shorter formats to augment existing programs

## Abstract

Like medicine, psychology, or education, data science is fundamentally an applied discipline, with most students who receive advanced degrees in the field going on to work on practical problems. Unlike these disciplines, however, data science education remains heavily focused on theory and methods, and practical coursework typically revolves around cleaned or simplified data sets that have little analog in professional applications. We believe that the environment in which new data scientists are trained should more accurately reflect that in which they will eventually practice and propose here a data science master's degree program that takes inspiration from the residency model used in medicine. Students in the suggested program would spend three years working on a practical problem with an industry, government, or nonprofit partner, supplemented with coursework in data science methods and theory. We also discuss how this program can also be implemented in shorter formats to augment existing professional masters programs in different disciplines. This approach to learning by doing is designed to fill gaps in our current approach to data science education and ensure that students develop the skills they need to practice data science in a professional context and under the many constraints imposed by that context.

---
Source: https://tomesphere.com/paper/1905.06875