Hardware based Scale- and Rotation-Invariant Feature Extraction: A Retrospective Analysis and Future Directions
Shoaib Ehsan, Adrian F. Clark, Klaus D. McDonald-Maier

TL;DR
This paper reviews hardware-based solutions for real-time, scale- and rotation-invariant feature extraction in computer vision, analyzing past progress, current challenges, and future research directions.
Contribution
It provides a comprehensive retrospective analysis of hardware implementations for invariant feature extraction and outlines future research avenues in this emerging field.
Findings
Hardware solutions enable real-time performance for complex algorithms.
Current methods achieve 2-3 Hz speeds on desktop computers.
Identifies research gaps and suggests future hardware design strategies.
Abstract
Computer Vision techniques represent a class of algorithms that are highly computation and data intensive in nature. Generally, performance of these algorithms in terms of execution speed on desktop computers is far from real-time. Since real-time performance is desirable in many applications, special-purpose hardware is required in most cases to achieve this goal. Scale- and rotation-invariant local feature extraction is a low level computer vision task with very high computational complexity. The state-of-the-art algorithms that currently exist in this domain, like SIFT and SURF, suffer from slow execution speeds and at best can only achieve rates of 2-3 Hz on modern desktop computers. Hardware-based scale- and rotation-invariant local feature extraction is an emerging trend enabling real-time performance for these computationally complex algorithms. This paper takes a retrospective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · CCD and CMOS Imaging Sensors · Advanced Vision and Imaging
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
