Layout-induced Video Representation for Recognizing Agent-in-Place Actions
Ruichi Yu, Hongcheng Wang, Ang Li, Jingxiao Zheng, Vlad I. Morariu,, Larry S. Davis

TL;DR
This paper proposes a novel scene layout-based video representation, LIVR, that improves recognition of agent-in-place actions in outdoor surveillance by capturing geometric relationships and generalizing across unseen layouts.
Contribution
The introduction of LIVR, a layout-induced video representation that encodes scene geometry and topology, enabling better generalization to unseen outdoor scene layouts for agent-in-place action recognition.
Findings
LIVR improves generalization to unseen layouts.
The Agent-in-Place Action dataset supports the effectiveness of LIVR.
LIVR outperforms baseline methods in recognition accuracy.
Abstract
We address the recognition of agent-in-place actions, which are associated with agents who perform them and places where they occur, in the context of outdoor home surveillance. We introduce a representation of the geometry and topology of scene layouts so that a network can generalize from the layouts observed in the training set to unseen layouts in the test set. This Layout-Induced Video Representation (LIVR) abstracts away low-level appearance variance and encodes geometric and topological relationships of places in a specific scene layout. LIVR partitions the semantic features of a video clip into different places to force the network to learn place-based feature descriptions; to predict the confidence of each action, LIVR aggregates features from the place associated with an action and its adjacent places on the scene layout. We introduce the Agent-in-Place Action dataset to show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications
