# Look Further to Recognize Better: Learning Shared Topics and   Category-Specific Dictionaries for Open-Ended 3D Object Recognition

**Authors:** S. Hamidreza Kasaei

arXiv: 1907.12924 · 2019-07-31

## TL;DR

This paper introduces an open-ended 3D object recognition method that learns shared visual topics and category-specific dictionaries, improving scalability and accuracy in real-world, long-term robotic applications.

## Contribution

It proposes a novel unsupervised approach that simultaneously learns general and specific object representations incrementally from new views.

## Key findings

- Significant improvements over state-of-the-art in scalability and classification accuracy.
- Effective learning from very few training examples.
- Best computational performance achieved with Bag-of-Words and LDA variants.

## Abstract

Service robots are expected to operate effectively in human-centric environments for long periods of time. In such realistic scenarios, fine-grained object categorization is as important as basic-level object categorization. We tackle this problem by proposing an open-ended object recognition approach which concurrently learns both the object categories and the local features for encoding objects. In this work, each object is represented using a set of general latent visual topics and category-specific dictionaries. The general topics encode the common patterns of all categories, while the category-specific dictionary describes the content of each category in details. The proposed approach discovers both sets of general and specific representations in an unsupervised fashion and updates them incrementally using new object views. Experimental results show that our approach yields significant improvements over the previous state-of-the-art approaches concerning scalability and object classification performance. Moreover, our approach demonstrates the capability of learning from very few training examples in a real-world setting. Regarding computation time, the best result was obtained with a Bag-of-Words method followed by a variant of the Latent Dirichlet Allocation approach.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.12924/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1907.12924/full.md

## References

20 references — full list in the complete paper: https://tomesphere.com/paper/1907.12924/full.md

---
Source: https://tomesphere.com/paper/1907.12924