Generalizing in the Real World with Representation Learning
Tegan Maharaj

TL;DR
This paper critically examines the assumptions and norms in machine learning, especially in deep networks, highlighting their limitations in real-world applications and proposing ways to improve generalization beyond traditional settings.
Contribution
It provides a critical analysis of current ML assumptions, identifies failures in real-world scenarios, and suggests practical approaches to enhance deep network generalization.
Findings
Deep networks often fail to generalize in out-of-distribution settings.
Current assumptions like i.i.d. data are frequently invalid in real-world applications.
Understanding why deep networks generalize remains an open challenge.
Abstract
Machine learning (ML) formalizes the problem of getting computers to learn from experience as optimization of performance according to some metric(s) on a set of data examples. This is in contrast to requiring behaviour specified in advance (e.g. by hard-coded rules). Formalization of this problem has enabled great progress in many applications with large real-world impact, including translation, speech recognition, self-driving cars, and drug discovery. But practical instantiations of this formalism make many assumptions - for example, that data are i.i.d.: independent and identically distributed - whose soundness is seldom investigated. And in making great progress in such a short time, the field has developed many norms and ad-hoc standards, focused on a relatively small range of problem settings. As applications of ML, particularly in artificial intelligence (AI) systems, become…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Stochastic Gradient Optimization Techniques · Machine Learning in Healthcare
