Hardware based Scale- and Rotation-Invariant Feature Extraction: A   Retrospective Analysis and Future Directions

Shoaib Ehsan; Adrian F. Clark; Klaus D. McDonald-Maier

arXiv:1504.07962·cs.CV·April 30, 2015

Hardware based Scale- and Rotation-Invariant Feature Extraction: A Retrospective Analysis and Future Directions

Shoaib Ehsan, Adrian F. Clark, Klaus D. McDonald-Maier

PDF

Open Access

TL;DR

This paper reviews hardware-based solutions for real-time, scale- and rotation-invariant feature extraction in computer vision, analyzing past progress, current challenges, and future research directions.

Contribution

It provides a comprehensive retrospective analysis of hardware implementations for invariant feature extraction and outlines future research avenues in this emerging field.

Findings

01

Hardware solutions enable real-time performance for complex algorithms.

02

Current methods achieve 2-3 Hz speeds on desktop computers.

03

Identifies research gaps and suggests future hardware design strategies.

Abstract

Computer Vision techniques represent a class of algorithms that are highly computation and data intensive in nature. Generally, performance of these algorithms in terms of execution speed on desktop computers is far from real-time. Since real-time performance is desirable in many applications, special-purpose hardware is required in most cases to achieve this goal. Scale- and rotation-invariant local feature extraction is a low level computer vision task with very high computational complexity. The state-of-the-art algorithms that currently exist in this domain, like SIFT and SURF, suffer from slow execution speeds and at best can only achieve rates of 2-3 Hz on modern desktop computers. Hardware-based scale- and rotation-invariant local feature extraction is an emerging trend enabling real-time performance for these computationally complex algorithms. This paper takes a retrospective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · CCD and CMOS Imaging Sensors · Advanced Vision and Imaging

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings