BMN: Boundary-Matching Network for Temporal Action Proposal Generation
Tianwei Lin, Xiao Liu, Xin Li, Errui Ding, Shilei Wen

TL;DR
This paper introduces BMN, a boundary-matching network that efficiently generates precise temporal action proposals with reliable confidence scores, significantly improving performance on challenging datasets.
Contribution
The paper proposes the Boundary-Matching mechanism and an end-to-end network, BMN, for improved temporal action proposal generation with accurate boundaries and confidence scores.
Findings
Significant performance improvements on THUMOS-14 and ActivityNet-1.3 datasets.
Efficient end-to-end training of proposal generation and confidence scoring.
Achieves state-of-the-art temporal action detection when combined with classifiers.
Abstract
Temporal action proposal generation is an challenging and promising task which aims to locate temporal regions in real-world videos where action or event may occur. Current bottom-up proposal generation methods can generate proposals with precise boundary, but cannot efficiently generate adequately reliable confidence scores for retrieving proposals. To address these difficulties, we introduce the Boundary-Matching (BM) mechanism to evaluate confidence scores of densely distributed proposals, which denote a proposal as a matching pair of starting and ending boundaries and combine all densely distributed BM pairs into the BM confidence map. Based on BM mechanism, we propose an effective, efficient and end-to-end proposal generation method, named Boundary-Matching Network (BMN), which generates proposals with precise temporal boundaries as well as reliable confidence scores…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Human Motion and Animation
