OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under Occlusions

Guanyu Zhou; Wenxuan Liu; Wenxin Huang; Xuemei Jia; Xian Zhong; Chia-Wen Lin

arXiv:2411.15729·cs.CV·June 10, 2025

OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under Occlusions

Guanyu Zhou, Wenxuan Liu, Wenxin Huang, Xuemei Jia, Xian Zhong, Chia-Wen Lin

PDF

Open Access 1 Repo

TL;DR

This paper introduces OccludeNet, a large-scale occluded video dataset with real and synthetic occlusions, and proposes a causal model-based method to improve action recognition robustness under occlusion.

Contribution

It provides a new dataset with diverse occlusion scenarios and develops a causal inference approach for more robust action recognition in occluded scenes.

Findings

01

Occlusion impacts different action classes variably.

02

The causal model improves recognition accuracy under occlusion.

03

The dataset enables studying causal links in occluded scenes.

Abstract

The lack of occlusion data in common action recognition video datasets limits model robustness and hinders consistent performance gains. We build OccludeNet, a large-scale occluded video dataset including both real and synthetic occlusion scenes in different natural settings. OccludeNet includes dynamic occlusion, static occlusion, and multi-view interactive occlusion, addressing gaps in current datasets. Our analysis shows occlusion affects action classes differently: actions with low scene relevance and partial body visibility see larger drops in accuracy. To overcome the limits of existing occlusion-aware methods, we propose a structural causal model for occluded scenes and introduce the Causal Action Recognition (CAR) method, which uses backdoor adjustment and counterfactual reasoning. This approach strengthens key actor information and improves model robustness to occlusion. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

The-Martyr/OccludeNet-Dataset
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods