Memory-Centric Embodied Question Answering
Mingliang Zhai, Zhi Gao, Yuwei Wu, Yunde Jia

TL;DR
MemoryEQA introduces a memory-centric framework for embodied question answering, enhancing exploration and reasoning by integrating memory throughout the process, leading to significant performance improvements.
Contribution
The paper proposes a novel memory-centric EQA framework with structured memory management and adaptive retrieval, improving upon existing methods that underutilize memory.
Findings
9.9% performance gain on MT-HM3D benchmark
Effective memory utilization improves task success rates
Constructed a new benchmark with 1,587 question-answer pairs
Abstract
Embodied Question Answering (EQA) requires agents to autonomously explore and comprehend the environment to answer context-dependent questions. Typically, an EQA framework consists of four components: a planner, a memory module, a stopping module, and an answering module. However, the memory module is utilized inefficiently in existing methods, as the information it stores is leveraged solely for the answering module. Such a design may result in redundant or inadequate exploration, leading to a suboptimal success rate. To solve this problem, we propose MemoryEQA, an EQA framework centered on memory, which establishes mechanisms for memory storage, update, and retrieval, allowing memory information to contribute throughout the entire exploration process. Specifically, we convert the observation into structured textual representations, which are stored in a vector library following a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovative Teaching and Learning Methods
