# Supervise Thyself: Examining Self-Supervised Representations in   Interactive Environments

**Authors:** Evan Racah, Christopher Pal

arXiv: 1906.11951 · 2019-07-01

## TL;DR

This paper investigates how self-supervised learning methods develop environment representations in visual, interactive settings, assessing their ability to encode true state information and generalize across different scenarios.

## Contribution

It provides a comparative analysis of self-supervised representations in two visual environments, highlighting their dependence on environment visuals and dynamics.

## Key findings

- Representations vary significantly with environment visuals and dynamics.
- Some self-supervised methods better capture true state information.
- Generalization of representations is environment-dependent.

## Abstract

Self-supervised methods, wherein an agent learns representations solely by observing the results of its actions, become crucial in environments which do not provide a dense reward signal or have labels. In most cases, such methods are used for pretraining or auxiliary tasks for "downstream" tasks, such as control, exploration, or imitation learning. However, it is not clear which method's representations best capture meaningful features of the environment, and which are best suited for which types of environments. We present a small-scale study of self-supervised methods on two visual environments: Flappy Bird and Sonic The Hedgehog. In particular, we quantitatively evaluate the representations learned from these tasks in two contexts: a) the extent to which the representations capture true state information of the agent and b) how generalizable these representations are to novel situations, like new levels and textures. Lastly, we evaluate these self-supervised features by visualizing which parts of the environment they focus on. Our results show that the utility of the representations is highly dependent on the visuals and dynamics of the environment.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.11951/full.md

## Figures

22 figures with captions in the complete paper: https://tomesphere.com/paper/1906.11951/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/1906.11951/full.md

---
Source: https://tomesphere.com/paper/1906.11951