A Deep Moving-camera Background Model
Guy Erez, Ron Shapira Weber, Oren Freifeld

TL;DR
DeepMCBM introduces an end-to-end deep learning approach for moving-camera background modeling that overcomes previous limitations in scalability, camera motion support, and the need for specialized initialization, achieving state-of-the-art results.
Contribution
The paper presents a novel joint alignment strategy with a spatial transformer net and an autoencoder conditioned on unwarped moments, enabling scalable, flexible, and regularization-free background modeling for moving cameras.
Findings
Supports a broad range of camera motions.
Outperforms existing methods on various videos.
Eliminates the need for specialized initialization.
Abstract
In video analysis, background models have many applications such as background/foreground separation, change detection, anomaly detection, tracking, and more. However, while learning such a model in a video captured by a static camera is a fairly-solved task, in the case of a Moving-camera Background Model (MCBM), the success has been far more modest due to algorithmic and scalability challenges that arise due to the camera motion. Thus, existing MCBMs are limited in their scope and their supported camera-motion types. These hurdles also impeded the employment, in this unsupervised task, of end-to-end solutions based on deep learning (DL). Moreover, existing MCBMs usually model the background either on the domain of a typically-large panoramic image or in an online fashion. Unfortunately, the former creates several problems, including poor scalability, while the latter prevents the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Vision and Imaging · Human Pose and Action Recognition
MethodsSpatial Transformer
