iMapper: Interaction-guided Joint Scene and Human Motion Mapping from Monocular Videos
Aron Monszpart, Paul Guerrero, Duygu Ceylan, Ersin Yumer, Niloy J., Mitra

TL;DR
iMapper is a data-driven approach that jointly recovers scene arrangements and human interactions from monocular videos by exploiting the correlation between human motions and object configurations, especially under occlusion.
Contribution
The paper introduces iMapper, a novel method that jointly reasons about human-object interactions and scene layout from monocular videos, improving robustness under occlusion.
Findings
Significantly outperforms state-of-the-art methods in occluded scenarios.
Effectively recovers plausible scene arrangements and human motions.
Demonstrates robustness to moderate and heavy occlusion.
Abstract
A long-standing challenge in scene analysis is the recovery of scene arrangements under moderate to heavy occlusion, directly from monocular video. While the problem remains a subject of active research, concurrent advances have been made in the context of human pose reconstruction from monocular video, including image-space feature point detection and 3D pose recovery. These methods, however, start to fail under moderate to heavy occlusion as the problem becomes severely under-constrained. We approach the problems differently. We observe that people interact similarly in similar scenes. Hence, we exploit the correlation between scene object arrangement and motions performed in that scene in both directions: first, typical motions performed when interacting with objects inform us about possible object arrangements; and second, object arrangements, in turn, constrain the possible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
