Submission to Generic Event Boundary Detection Challenge@CVPR 2022:   Local Context Modeling and Global Boundary Decoding Approach

Jiaqi Tang; Zhaoyang Liu; Jing Tan; Chen Qian; Wayne Wu; Limin Wang

arXiv:2206.15268·cs.CV·July 1, 2022·1 cites

Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach

Jiaqi Tang, Zhaoyang Liu, Jing Tan, Chen Qian, Wayne Wu, Limin Wang

PDF

Open Access

TL;DR

This paper introduces a local context modeling and global boundary decoding approach for generic event boundary detection in videos, significantly improving accuracy by over 22% F1-score on the Kinetics-GEBD dataset.

Contribution

It proposes a novel combination of local context modeling and global boundary decoding for more accurate event boundary detection in videos.

Findings

01

Achieved 85.13% F1-score on Kinetics-GEBD dataset.

02

Improved F1-score by over 22% compared to baseline.

03

Demonstrated effectiveness of combined local and global modeling.

Abstract

Generic event boundary detection (GEBD) is an important yet challenging task in video understanding, which aims at detecting the moments where humans naturally perceive event boundaries. In this paper, we present a local context modeling and global boundary decoding approach for GEBD task. Local context modeling sub-network is proposed to perceive diverse patterns of generic event boundaries, and it generates powerful video representations and reliable boundary confidence. Based on them, global boundary decoding sub-network is exploited to decode event boundaries from a global view. Our proposed method achieves 85.13% F1-score on Kinetics-GEBD testing set, which achieves a more than 22% F1-score boost compared to the baseline method. The code is available at https://github.com/JackyTown/GEBD_Challenge_CVPR2022.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Analysis and Summarization