Embodied Navigation at the Art Gallery

Roberto Bigazzi; Federico Landi; Silvia Cascianelli; Marcella Cornia,; Lorenzo Baraldi; Rita Cucchiara

arXiv:2204.09069·cs.CV·April 16, 2024

Embodied Navigation at the Art Gallery

Roberto Bigazzi, Federico Landi, Silvia Cascianelli, Marcella Cornia,, Lorenzo Baraldi, Rita Cucchiara

PDF

Open Access 1 Repo

TL;DR

This paper introduces ArtGallery3D, a new complex 3D environment of an art museum for embodied navigation, providing a challenging benchmark that reveals limitations of current methods and aims to foster future research.

Contribution

The paper presents a novel, richly detailed art museum environment for navigation benchmarks, with annotated points of interest and complex trajectories, expanding beyond existing indoor datasets.

Findings

01

Existing navigation methods struggle in the new environment.

02

Trajectories are more complex and longer than in previous datasets.

03

The environment highlights the need for improved navigation algorithms.

Abstract

Embodied agents, trained to explore and navigate indoor photorealistic environments, have achieved impressive results on standard datasets and benchmarks. So far, experiments and evaluations have involved domestic and working scenes like offices, flats, and houses. In this paper, we build and release a new 3D space with unique characteristics: the one of a complete art museum. We name this environment ArtGallery3D (AG3D). Compared with existing 3D scenes, the collected space is ampler, richer in visual features, and provides very sparse occupancy information. This feature is challenging for occupancy-based agents which are usually trained in crowded domestic environments with plenty of occupancy information. Additionally, we annotate the coordinates of the main points of interest inside the museum, such as paintings, statues, and other items. Thanks to this manual process, we deliver a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aimagelab/ag3d
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Video Surveillance and Tracking Methods · Multimodal Machine Learning Applications