Generalizing in the Real World with Representation Learning

Tegan Maharaj

arXiv:2210.09925·cs.LG·October 19, 2022

Generalizing in the Real World with Representation Learning

Tegan Maharaj

PDF

Open Access 1 Repo

TL;DR

This paper critically examines the assumptions and norms in machine learning, especially in deep networks, highlighting their limitations in real-world applications and proposing ways to improve generalization beyond traditional settings.

Contribution

It provides a critical analysis of current ML assumptions, identifies failures in real-world scenarios, and suggests practical approaches to enhance deep network generalization.

Findings

01

Deep networks often fail to generalize in out-of-distribution settings.

02

Current assumptions like i.i.d. data are frequently invalid in real-world applications.

03

Understanding why deep networks generalize remains an open challenge.

Abstract

Machine learning (ML) formalizes the problem of getting computers to learn from experience as optimization of performance according to some metric(s) on a set of data examples. This is in contrast to requiring behaviour specified in advance (e.g. by hard-coded rules). Formalization of this problem has enabled great progress in many applications with large real-world impact, including translation, speech recognition, self-driving cars, and drug discovery. But practical instantiations of this formalism make many assumptions - for example, that data are i.i.d.: independent and identically distributed - whose soundness is seldom investigated. And in making great progress in such a short time, the field has developed many norms and ad-hoc standards, focused on a relatively small range of problem settings. As applications of ML, particularly in artificial intelligence (AI) systems, become…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

teganmaharaj/zoneout
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Stochastic Gradient Optimization Techniques · Machine Learning in Healthcare