Automatic Ground Truths: Projected Image Annotations for Omnidirectional   Vision

Victor Stamatescu; Peter Barsznica; Manjung Kim; Kin K. Liu; Mark; McKenzie; Will Meakin; Gwilyn Saunders; Sebastien C. Wong; Russell S. A.; Brinkworth

arXiv:1709.03697·cs.CV·September 13, 2017

Automatic Ground Truths: Projected Image Annotations for Omnidirectional Vision

Victor Stamatescu, Peter Barsznica, Manjung Kim, Kin K. Liu, Mark, McKenzie, Will Meakin, Gwilyn Saunders, Sebastien C. Wong, Russell S. A., Brinkworth

PDF

TL;DR

This paper introduces a new omnidirectional video dataset with automatically annotated object positions, facilitating training and evaluation of scene understanding algorithms in spherical imagery.

Contribution

It provides a novel dataset with automatically generated ground truth annotations for omnidirectional vision, along with calibration tools and error estimation methods.

Findings

01

Dataset enables improved training of object detection algorithms

02

Automated annotations reduce manual labeling effort

03

Software tools facilitate calibration and comparison

Abstract

We present a novel data set made up of omnidirectional video of multiple objects whose centroid positions are annotated automatically. Omnidirectional vision is an active field of research focused on the use of spherical imagery in video analysis and scene understanding, involving tasks such as object detection, tracking and recognition. Our goal is to provide a large and consistently annotated video data set that can be used to train and evaluate new algorithms for these tasks. Here we describe the experimental setup and software environment used to capture and map the 3D ground truth positions of multiple objects into the image. Furthermore, we estimate the expected systematic error on the mapped positions. In addition to final data products, we release publicly the software tools and raw data necessary to re-calibrate the camera and/or redo this mapping. The software also provides a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.