OpenCL-accelerated object classification in video streams using Spatial Pooler of Hierarchical Temporal Memory
Maciej Wielgosz, Marcin Pietro\'n

TL;DR
This paper presents an OpenCL-accelerated object classification system in video streams using Hierarchical Temporal Memory, achieving high accuracy and significant speed-up on selected computational parts, though overall acceleration is limited by data transfer overhead.
Contribution
The paper introduces an OpenCL implementation for accelerating HTM-based object classification in video streams, demonstrating substantial kernel speed-ups and analyzing performance trade-offs.
Findings
F1 score up to 0.95 achieved.
Kernel speed-up up to 632x on GPU.
Overall acceleration limited to 6.5x due to transfer times.
Abstract
We present a method to classify objects in video streams using a brain-inspired Hierarchical Temporal Memory (HTM) algorithm. Object classification is a challenging task where humans still significantly outperform machine learning algorithms due to their unique capabilities. We have implemented a system which achieves very promising performance in terms of recognition accuracy. Unfortunately, conducting more advanced experiments is very computationally demanding; some of the trials run on a standard CPU may take as long as several days for 960x540 video streams frames. Therefore we have decided to accelerate selected parts of the system using OpenCL. In particular, we seek to determine to what extent porting selected and computationally demanding parts of a core may speed up calculations. The classification accuracy of the system was examined through a series of experiments and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
