Non-Linear Temporal Subspace Representations for Activity Recognition
Anoop Cherian, Suvrit Sra, Stephen Gould, Richard Hartley

TL;DR
This paper introduces a novel kernelized rank pooling method for representing temporal evolution in multivariate time-series data, improving human activity recognition accuracy.
Contribution
It proposes a new non-linear pooling technique using kernelized rank pooling and kernelized PCA, optimized on Grassmann manifolds for better activity recognition.
Findings
Achieves state-of-the-art results on multiple action recognition datasets.
Effectively captures non-linear temporal dynamics of actions.
Demonstrates robustness across diverse feature modalities.
Abstract
Representations that can compactly and effectively capture the temporal evolution of semantic content are important to computer vision and machine learning algorithms that operate on multi-variate time-series data. We investigate such representations motivated by the task of human action recognition. Here each data instance is encoded by a multivariate feature (such as via a deep CNN) where action dynamics are characterized by their variations in time. As these features are often non-linear, we propose a novel pooling method, kernelized rank pooling, that represents a given sequence compactly as the pre-image of the parameters of a hyperplane in a reproducing kernel Hilbert space, projections of data onto which captures their temporal order. We develop this idea further and show that such a pooling scheme can be cast as an order-constrained kernelized PCA objective. We then propose to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPrincipal Components Analysis
