Activity recognition from videos with parallel hypergraph matching on GPUs
Eric Lombardi, Christian Wolf, Oya Celiktutan, B\"ulent, Sankur

TL;DR
This paper introduces a GPU-accelerated hypergraph matching method for activity recognition in videos, leveraging temporal properties for fast, exact solutions that outperform traditional approximate algorithms.
Contribution
It presents a novel parallel GPU algorithm for exact hypergraph matching tailored for activity recognition, exploiting temporal data properties for efficiency.
Findings
Achieves faster-than-real-time performance on medium-end GPUs.
Provides exact solutions to hypergraph matching problems in activity recognition.
Simplifies and regularizes graphical structures for efficient recursive minimization.
Abstract
In this paper, we propose a method for activity recognition from videos based on sparse local features and hypergraph matching. We benefit from special properties of the temporal domain in the data to derive a sequential and fast graph matching algorithm for GPUs. Traditionally, graphs and hypergraphs are frequently used to recognize complex and often non-rigid patterns in computer vision, either through graph matching or point-set matching with graphs. Most formulations resort to the minimization of a difficult discrete energy function mixing geometric or structural terms with data attached terms involving appearance features. Traditional methods solve this minimization problem approximately, for instance with spectral techniques. In this work, instead of solving the problem approximatively, the exact solution for the optimal assignment is calculated in parallel on GPUs. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Graph Theory and Algorithms · Multimodal Machine Learning Applications
