Behavior Discovery and Alignment of Articulated Object Classes from Unstructured Video
Luca Del Pero, Susanna Ricco, Rahul Sukthankar, Vittorio Ferrari

TL;DR
This paper introduces an automatic system that organizes unstructured videos of articulated objects by discovering behaviors and aligning instances, leveraging novel motion representations and deformation models to handle appearance variations.
Contribution
It presents a new motion representation (PoTs) for behavior discovery and a flexible Thin Plate Spline model for accurate pixel alignment across diverse object instances.
Findings
Outperforms state-of-the-art in behavior discovery with Improved DTF descriptor.
Achieves higher accuracy in spatial alignment than SIFT Flow.
Successfully handles significant appearance variations among object instances.
Abstract
We propose an automatic system for organizing the content of a collection of unstructured videos of an articulated object class (e.g. tiger, horse). By exploiting the recurring motion patterns of the class across videos, our system: 1) identifies its characteristic behaviors; and 2) recovers pixel-to-pixel alignments across different instances. Our system can be useful for organizing video collections for indexing and retrieval. Moreover, it can be a platform for learning the appearance or behaviors of object classes from Internet video. Traditional supervised techniques cannot exploit this wealth of data directly, as they require a large amount of time-consuming manual annotations. The behavior discovery stage generates temporal video intervals, each automatically trimmed to one instance of the discovered behavior, clustered by type. It relies on our novel motion representation for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
