IndustryEQA: Pushing the Frontiers of Embodied Question Answering in Industrial Scenarios
Yifan Li, Yuhang Chen, Anh Dao, Lichi Li, Zhongyi Cai, Zhen Tan, Tianlong Chen, Yu Kong

TL;DR
IndustryEQA introduces a novel benchmark for evaluating embodied question answering agents specifically in safety-critical industrial warehouse scenarios, emphasizing safety, perception, and reasoning capabilities in realistic environments.
Contribution
It is the first benchmark dedicated to industrial EQA, featuring high-fidelity videos, safety annotations, and a comprehensive evaluation framework for industrial applications.
Findings
Benchmark includes 1344 QA pairs from diverse warehouse scenarios.
Baseline models show significant room for improvement in safety and reasoning tasks.
IndustryEQA promotes development of robust, safety-aware embodied agents for industrial use.
Abstract
Existing Embodied Question Answering (EQA) benchmarks primarily focus on household environments, often overlooking safety-critical aspects and reasoning processes pertinent to industrial settings. This drawback limits the evaluation of agent readiness for real-world industrial applications. To bridge this, we introduce IndustryEQA, the first benchmark dedicated to evaluating embodied agent capabilities within safety-critical warehouse scenarios. Built upon the NVIDIA Isaac Sim platform, IndustryEQA provides high-fidelity episodic memory videos featuring diverse industrial assets, dynamic human agents, and carefully designed hazardous situations inspired by real-world safety guidelines. The benchmark includes rich annotations covering six categories: equipment safety, human safety, object recognition, attribute recognition, temporal understanding, and spatial understanding. Besides, it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems
