Changes from Classical Statistics to Modern Statistics and Data Science
Kai Zhang, Shan Liu, and Momiao Xiong

TL;DR
This paper emphasizes the need to extend classical statistical methods from Euclidean to non-Euclidean spaces to better analyze modern complex data types in AI and data science.
Contribution
It advocates for developing a unified framework that integrates Euclidean and non-Euclidean data analysis, advancing the theoretical foundation of modern statistics and AI.
Findings
Highlighting limitations of classical Euclidean-based methods
Proposing a shift towards non-Euclidean data analysis frameworks
Encouraging integration of statistics and AI for next-generation data science
Abstract
A coordinate system is a foundation for every quantitative science, engineering, and medicine. Classical physics and statistics are based on the Cartesian coordinate system. The classical probability and hypothesis testing theory can only be applied to Euclidean data. However, modern data in the real world are from natural language processing, mathematical formulas, social networks, transportation and sensor networks, computer visions, automations, and biomedical measurements. The Euclidean assumption is not appropriate for non Euclidean data. This perspective addresses the urgent need to overcome those fundamental limitations and encourages extensions of classical probability theory and hypothesis testing , diffusion models and stochastic differential equations from Euclidean space to non Euclidean space. Artificial intelligence such as natural language processing, computer vision,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Geochemistry and Geologic Mapping · Computational Physics and Python Applications
MethodsDiffusion · Network On Network
