# Bridging between Computer and Robot Vision through Data Augmentation: a   Case Study on Object Recognition

**Authors:** Antonio D'Innocente, Fabio Maria Carlucci, Mirco Colosi, Barbara, Caputo

arXiv: 1705.02139 · 2017-05-08

## TL;DR

This paper introduces a data augmentation layer that simulates robot vision conditions to improve object recognition performance of deep networks, achieving up to 7% accuracy increase across multiple benchmarks.

## Contribution

It presents a novel data augmentation technique tailored for robot vision, bridging the gap with traditional image datasets and enhancing recognition accuracy.

## Key findings

- Up to 7% improvement in object recognition accuracy
- Effective across three benchmark datasets
- Compatible with any convolutional deep architecture

## Abstract

Despite the impressive progress brought by deep network in visual object recognition, robot vision is still far from being a solved problem. The most successful convolutional architectures are developed starting from ImageNet, a large scale collection of images of object categories downloaded from the Web. This kind of images is very different from the situated and embodied visual experience of robots deployed in unconstrained settings. To reduce the gap between these two visual experiences, this paper proposes a simple yet effective data augmentation layer that zooms on the object of interest and simulates the object detection outcome of a robot vision system. The layer, that can be used with any convolutional deep architecture, brings to an increase in object recognition performance of up to 7\%, in experiments performed over three different benchmark databases. Upon acceptance of the paper, our robot data augmentation layer will be made publicly available.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.02139/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/1705.02139/full.md

## References

15 references — full list in the complete paper: https://tomesphere.com/paper/1705.02139/full.md

---
Source: https://tomesphere.com/paper/1705.02139