Object Recognition by Using Multi-level Feature Point Extraction

Yang Cheng; Timeo Dubois

arXiv:1710.10522·cs.CV·October 31, 2017

Object Recognition by Using Multi-level Feature Point Extraction

Yang Cheng, Timeo Dubois

PDF

Open Access

TL;DR

This paper introduces a real-time object recognition method using multi-level feature point extraction integrated with a Naive Bayesian classifier, achieving high accuracy and efficiency across diverse conditions.

Contribution

It presents a novel multi-level feature analysis approach combined with a Naive Bayesian framework for robust, scalable, and real-time object recognition.

Findings

01

Performs well with large variation in illumination and perspective

02

Achieves high accuracy on challenging datasets

03

Operates efficiently on large-resolution images

Abstract

In this paper, we present a novel approach for object recognition in real-time by employing multilevel feature analysis and demonstrate the practicality of adapting feature extraction into a Naive Bayesian classification framework that enables simple, efficient, and robust performance. We also show the proposed method scales well as the number of level-classes grows. To effectively understand the patches surrounding a keypoint, the trained classifier uses hundreds of simple binary features and models class posterior probabilities. In addition, the classification process is computationally cheap under the assumed independence between arbitrary sets of features. Even though for some particular scenarios, this assumption can be invalid. We demonstrate that the efficient classifier nevertheless performs remarkably well on image datasets with a large variation in the illumination environment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings