Understanding the World Through Action

Sergey Levine

arXiv:2110.12543·cs.LG·October 26, 2021

Understanding the World Through Action

Sergey Levine

PDF

Open Access 1 Repo

TL;DR

This paper proposes a reinforcement learning framework that leverages large unlabeled datasets with self-supervised objectives, aiming to improve the scalability and alignment of machine learning models with downstream tasks.

Contribution

It introduces a novel reinforcement learning-based approach for utilizing unlabeled data through self-supervised objectives combined with offline RL techniques.

Findings

01

Framework aligns better with downstream tasks

02

Leverages large unlabeled datasets effectively

03

Builds on recent advances in self-supervised RL

Abstract

The recent history of machine learning research has taught us that machine learning methods can be most effective when they are provided with very large, high-capacity models, and trained on very large and diverse datasets. This has spurred the community to search for ways to remove any bottlenecks to scale. Often the foremost among such bottlenecks is the need for human effort, including the effort of curating and labeling datasets. As a result, considerable attention in recent years has been devoted to utilizing unlabeled data, which can be collected in vast quantities. However, some of the most widely used methods for training on such unlabeled data themselves require human-designed objective functions that must correlate in some meaningful way to downstream tasks. I will argue that a general, principled, and powerful framework for utilizing unlabeled data can be derived from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yifan123/arxiv_spider
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning