Efficient Implementation of a Recognition System Using the Cortex   Ventral Stream Model

Ahmad W. Bitar (Ahmad Wasfi Bitar); Mohammad M. Mansour; Ali Chehab

arXiv:1711.07827·cs.CV·November 22, 2017

Efficient Implementation of a Recognition System Using the Cortex Ventral Stream Model

Ahmad W. Bitar (Ahmad Wasfi Bitar), Mohammad M. Mansour, Ali Chehab

PDF

TL;DR

This paper presents an optimized implementation of the HMAX visual recognition model that improves accuracy and reduces computational complexity through targeted enhancements at multiple processing layers.

Contribution

It introduces specific optimizations for the HMAX model layers, including illumination removal, Gabor filter convolution, and efficient prototype generation, enhancing performance on visual recognition tasks.

Findings

01

Over 10% accuracy improvement at S1 layer

02

Reduced computational complexity overall

03

Slight accuracy increases at C1 and S2 layers

Abstract

In this paper, an efficient implementation for a recognition system based on the original HMAX model of the visual cortex is proposed. Various optimizations targeted to increase accuracy at the so-called layers S1, C1, and S2 of the HMAX model are proposed. At layer S1, all unimportant information such as illumination and expression variations are eliminated from the images. Each image is then convolved with 64 separable Gabor filters in the spatial domain. At layer C1, the minimum scales values are exploited to be embedded into the maximum ones using the additive embedding space. At layer S2, the prototypes are generated in a more efficient way using Partitioning Around Medoid (PAM) clustering algorithm. The impact of these optimizations in terms of accuracy and computational complexity was evaluated on the Caltech101 database, and compared with the baseline performance using support…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.