Statistical Foundation Behind Machine Learning and Its Impact on   Computer Vision

Lei Zhang; Heung-Yeung Shum

arXiv:2209.02691·cs.CV·September 7, 2022

Statistical Foundation Behind Machine Learning and Its Impact on Computer Vision

Lei Zhang, Heung-Yeung Shum

PDF

Open Access

TL;DR

This paper explores the statistical principles underlying machine learning, especially uniform convergence, and discusses how large-scale data and future research directions impact computer vision and model robustness.

Contribution

It provides a theoretical perspective on the foundation of machine learning and highlights the importance of structure and knowledge for future advancements.

Findings

01

Large-scale data reduces empirical and expected loss discrepancy.

02

Pre-training improves representation learning in computer vision.

03

Fundamental research needed on robustness, interpretability, and reasoning.

Abstract

This paper revisits the principle of uniform convergence in statistical learning, discusses how it acts as the foundation behind machine learning, and attempts to gain a better understanding of the essential problem that current deep learning algorithms are solving. Using computer vision as an example domain in machine learning, the discussion shows that recent research trends in leveraging increasingly large-scale data to perform pre-training for representation learning are largely to reduce the discrepancy between a practically tractable empirical loss and its ultimately desired but intractable expected loss. Furthermore, this paper suggests a few future research directions, predicts the continued increase of data, and argues that more fundamental research is needed on robustness, interpretability, and reasoning capabilities of machine learning by incorporating structure and knowledge.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Neural Networks and Applications