Benchmarking Interaction, Beyond Policy: a Reproducible Benchmark for Collaborative Instance Object Navigation

Edoardo Zorzi; Francesco Taioli; Yiming Wang; Marco Cristani; Alessandro Farinelli; Alberto Castellini; Loris Bazzani

arXiv:2604.00265·cs.CV·April 2, 2026

Benchmarking Interaction, Beyond Policy: a Reproducible Benchmark for Collaborative Instance Object Navigation

Edoardo Zorzi, Francesco Taioli, Yiming Wang, Marco Cristani, Alessandro Farinelli, Alberto Castellini, Loris Bazzani

PDF

1 Repo

TL;DR

This paper introduces QAsk-Nav, a reproducible benchmark for collaborative object navigation that evaluates navigation and question-asking separately, enabling better assessment of interactive capabilities.

Contribution

It presents a new benchmark with a question-asking protocol, an enhanced navigation protocol, and a large dataset, along with a lightweight model that outperforms existing methods.

Findings

01

QAsk-Nav enables separate evaluation of navigation and question-asking.

02

Light-CoNav is 3x smaller and 70x faster than existing models.

03

Light-CoNav outperforms state-of-the-art approaches in unseen environments.

Abstract

We propose Question-Asking Navigation (QAsk-Nav), the first reproducible benchmark for Collaborative Instance Object Navigation (CoIN) that enables an explicit, separate assessment of embodied navigation and collaborative question asking. CoIN tasks an embodied agent with reaching a target specified in free-form natural language under partial observability, using only egocentric visual observations and interactive natural-language dialogue with a human, where the dialogue can help to resolve ambiguity among visually similar object instances. Existing CoIN benchmarks are primarily focused on navigation success and offer no support for consistent evaluation of collaborative interaction. To address this limitation, QAsk-Nav provides (i) a lightweight question-asking protocol scored independently of navigation, (ii) an enhanced navigation protocol with realistic, diverse, high-quality…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://benchmarking-interaction.github.io
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.