Learning Temporal Alignment Uncertainty for Efficient Event Detection
Iman Abbasnejad, Sridha Sridharan, Simon Denman, Clinton Fookes, Simon, Lucey

TL;DR
This paper introduces a novel fixed-dimensional representation that captures temporal alignment uncertainty in video sequences, improving efficiency and accuracy in event detection tasks.
Contribution
It proposes a new temporal alignment uncertainty representation that preserves ordering and integrates with linear detection functions, enhancing event detection performance.
Findings
Significant performance improvements on multiple datasets
Effective integration with linear detection functions
Enhanced handling of temporal alignment uncertainty
Abstract
In this paper we tackle the problem of efficient video event detection. We argue that linear detection functions should be preferred in this regard due to their scalability and efficiency during estimation and evaluation. A popular approach in this regard is to represent a sequence using a bag of words (BOW) representation due to its: (i) fixed dimensionality irrespective of the sequence length, and (ii) its ability to compactly model the statistics in the sequence. A drawback to the BOW representation, however, is the intrinsic destruction of the temporal ordering information. In this paper we propose a new representation that leverages the uncertainty in relative temporal alignments between pairs of sequences while not destroying temporal ordering. Our representation, like BOW, is of a fixed dimensionality making it easily integrated with a linear detection function. Extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
