ROAM: a Rich Object Appearance Model with Application to Rotoscoping

Ondrej Miksik; Juan-Manuel P\'erez-R\'ua; Philip H. S. Torr; Patrick; P\'erez

arXiv:1612.01495·cs.CV·December 6, 2016

ROAM: a Rich Object Appearance Model with Application to Rotoscoping

Ondrej Miksik, Juan-Manuel P\'erez-R\'ua, Philip H. S. Torr, Patrick, P\'erez

PDF

Open Access

TL;DR

This paper introduces ROAM, a comprehensive appearance model for rotoscoping that combines local and global features, enabling efficient, accurate, and adaptable object tracking in videos with minimal user input.

Contribution

ROAM presents a novel, rich appearance model that integrates local, global, and landmark features for improved rotoscoping, with efficient optimization and online adaptation capabilities.

Findings

01

Outperforms existing segmentation-based rotoscoping tools

02

Enables simple initialization and real-time adaptation

03

Provides both qualitative and quantitative validation

Abstract

Rotoscoping, the detailed delineation of scene elements through a video shot, is a painstaking task of tremendous importance in professional post-production pipelines. While pixel-wise segmentation techniques can help for this task, professional rotoscoping tools rely on parametric curves that offer the artists a much better interactive control on the definition, editing and manipulation of the segments of interest. Sticking to this prevalent rotoscoping paradigm, we propose a novel framework to capture and track the visual aspect of an arbitrary object in a scene, given a first closed outline of this object. This model combines a collection of local foreground/background appearance models spread along the outline, a global appearance model of the enclosed object and a set of distinctive foreground landmarks. The structure of this rich appearance model allows simple initialization,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis