Object Recognition by Using Multi-level Feature Point Extraction
Yang Cheng, Timeo Dubois

TL;DR
This paper introduces a real-time object recognition method using multi-level feature point extraction integrated with a Naive Bayesian classifier, achieving high accuracy and efficiency across diverse conditions.
Contribution
It presents a novel multi-level feature analysis approach combined with a Naive Bayesian framework for robust, scalable, and real-time object recognition.
Findings
Performs well with large variation in illumination and perspective
Achieves high accuracy on challenging datasets
Operates efficiently on large-resolution images
Abstract
In this paper, we present a novel approach for object recognition in real-time by employing multilevel feature analysis and demonstrate the practicality of adapting feature extraction into a Naive Bayesian classification framework that enables simple, efficient, and robust performance. We also show the proposed method scales well as the number of level-classes grows. To effectively understand the patches surrounding a keypoint, the trained classifier uses hundreds of simple binary features and models class posterior probabilities. In addition, the classification process is computationally cheap under the assumed independence between arbitrary sets of features. Even though for some particular scenarios, this assumption can be invalid. We demonstrate that the efficient classifier nevertheless performs remarkably well on image datasets with a large variation in the illumination environment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
