# The Visual Centrifuge: Model-Free Layered Video Representations

**Authors:** Jean-Baptiste Alayrac, Jo\~ao Carreira, Andrew Zisserman

arXiv: 1812.01461 · 2019-04-05

## TL;DR

This paper introduces a learning-based, model-free approach for layered video representations using novel 3D convolutional architectures that can separate blended videos and generalize to single videos, capturing complex scene effects.

## Contribution

The paper presents a new uncertainty-aware 3D convolutional architecture for multi-layered video modeling that relaxes previous assumptions and enables effective separation of blended scenes.

## Key findings

- Models generalize to single videos, demonstrating color constancy and reflection separation.
- Quantitative and qualitative results show improved scene understanding.
- Approach captures complex scene effects like shadows and reflections.

## Abstract

True video understanding requires making sense of non-lambertian scenes where the color of light arriving at the camera sensor encodes information about not just the last object it collided with, but about multiple mediums -- colored windows, dirty mirrors, smoke or rain. Layered video representations have the potential of accurately modelling realistic scenes but have so far required stringent assumptions on motion, lighting and shape. Here we propose a learning-based approach for multi-layered video representation: we introduce novel uncertainty-capturing 3D convolutional architectures and train them to separate blended videos. We show that these models then generalize to single videos, where they exhibit interesting abilities: color constancy, factoring out shadows and separating reflections. We present quantitative and qualitative results on real world videos.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1812.01461/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/1812.01461/full.md

## References

50 references — full list in the complete paper: https://tomesphere.com/paper/1812.01461/full.md

---
Source: https://tomesphere.com/paper/1812.01461