Generative AI Spotlights the Human Core of Data Science: Implications for Education
Nathan Taback

TL;DR
Generative AI automates routine data science tasks but emphasizes the importance of human reasoning skills, suggesting curricula should focus on core human competencies and effective prompt-based interactions.
Contribution
The paper highlights the enduring human core in data science education amidst GAI automation, proposing curriculum and assessment adjustments.
Findings
GAI automates data cleaning, summarizing, visualization, modeling, and reporting.
Core human skills like problem formulation and ethical reasoning remain essential.
Educational focus should shift to developing human reasoning and judgment in data science.
Abstract
Generative AI (GAI) reveals an irreducible human core at the center of data science: advances in GAI should sharpen, rather than diminish, the focus on human reasoning in data science education. GAI can now execute many routine data science workflows, including cleaning, summarizing, visualizing, modeling, and drafting reports. Yet the competencies that matter most remain irreducibly human: problem formulation, measurement and design, causal identification, statistical and computational reasoning, ethics and accountability, and sensemaking. Drawing on Donoho's Greater Data Science framework, Nolan and Temple Lang's vision of computational literacy, and the McLuhan-Culkin insight that we shape our tools and thereafter our tools shape us, this paper traces the emergence of data science through three converging lineages: Tukey's intellectual vision of data analysis as a science, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
