Adaptive Subsampling for ROI-based Visual Tracking: Algorithms and FPGA Implementation
Odrika Iqbal, Victor Isaac Torres Muro, Sameeksha Katoch, Andreas, Spanias, Suren Jayasuriya

TL;DR
This paper introduces adaptive subsampling algorithms for energy-efficient ROI-based visual tracking, combining object detection and prediction, implemented on FPGA, achieving competitive accuracy and power efficiency.
Contribution
It presents a novel adaptive subsampling approach for visual tracking that integrates ROI prediction with FPGA implementation, enhancing energy efficiency and real-time performance.
Findings
Coupling ECO tracker with Kalman filter yields AUC scores of 0.4568 (OTB100) and 0.3471 (LaSOT).
The FPGA implementation achieves near-real-time tracking at 19.23 FPS.
Power consumption is approximately 4 W for ECO-based and 6 W for YOLO-based algorithms.
Abstract
There is tremendous scope for improving the energy efficiency of embedded vision systems by incorporating programmable region-of-interest (ROI) readout in the image sensor design. In this work, we study how ROI programmability can be leveraged for tracking applications by anticipating where the ROI will be located in future frames and switching pixels off outside of this region. We refer to this process of ROI prediction and corresponding sensor configuration as adaptive subsampling. Our adaptive subsampling algorithms comprise an object detector and an ROI predictor (Kalman filter) which operate in conjunction to optimize the energy efficiency of the vision pipeline with the end task being object tracking. To further facilitate the implementation of our adaptive algorithms in real life, we select a candidate algorithm and map it onto an FPGA. Leveraging Xilinx Vitis AI tools, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Visual Attention and Saliency Detection · Infrared Target Detection Methodologies
MethodsThe Educational Competition Optimizer · You Only Look Once
