Data Science: A Comprehensive Overview
Longbing Cao

TL;DR
This paper provides a comprehensive overview of data science, covering its evolution, key concepts, challenges, opportunities, and future directions in the era of big data and data economy.
Contribution
It offers the first extensive big picture survey and tutorial on data science, integrating various aspects and future perspectives in the field.
Findings
Data science has evolved from data analysis to a distinct discipline.
Major challenges include data quality, privacy, and integration.
Opportunities lie in industrialization, services, and education in data economy.
Abstract
The twenty-first century has ushered in the age of big data and data economy, in which data DNA, which carries important knowledge, insights and potential, has become an intrinsic constituent of all data-based organisms. An appropriate understanding of data DNA and its organisms relies on the new field of data science and its keystone, analytics. Although it is widely debated whether big data is only hype and buzz, and data science is still in a very early phase, significant challenges and opportunities are emerging or have been inspired by the research, innovation, business, profession, and education of data science. This paper provides a comprehensive survey and tutorial of the fundamental aspects of data science: the evolution from data analysis to data science, the data science concepts, a big picture of the era of data science, the major challenges and directions in data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence · Research Data Management Practices · Data Quality and Management
