TL;DR
The paper introduces LP$^{2}$DH, a novel hashing framework for dynamic texture recognition that preserves locality and achieves state-of-the-art accuracy on major benchmarks.
Contribution
It proposes a joint encoding and optimization method that preserves local structure in pixel differences, improving dynamic texture recognition performance.
Findings
Achieved 99.80% accuracy on UCLA benchmark.
Outperformed existing methods on DynTex++ and YUPENN datasets.
Source code is publicly available at the provided GitHub link.
Abstract
Spatiotemporal Local Binary Pattern (STLBP) is a widely used dynamic texture descriptor, but it suffers from extremely high dimensionality. To tackle this, STLBP features are often extracted on three orthogonal planes, which sacrifice inter-plane correlation. In this work, we propose a Locality-Preserving Pixel-Difference Hashing (LPDH) framework that jointly encodes pixel differences in the full spatiotemporal neighbourhood. LPDH transforms Pixel-Difference Vectors (PDVs) into compact binary codes with maximal discriminative power. Furthermore, we incorporate a locality-preserving embedding to maintain the PDVs' local structure before and after hashing. Then, a curvilinear search strategy is utilized to jointly optimize the hashing matrix and binary codes via gradient descent on the Stiefel manifold. After hashing, dictionary learning is applied to encode the binary vectors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
