A Spoken Dialogue System for Spatial Question Answering in a Physical Blocks World
Georgiy Platonov, Benjamin Kane, Aaron Gindi, Lenhart K. Schubert

TL;DR
This paper presents a comprehensive spoken dialogue system for spatial question answering in a physical blocks world, integrating vision, speech, dialogue management, and spatial reasoning.
Contribution
It introduces a holistic system combining semantic parsing, dialogue management, and spatial constraint solving for robust spatial question answering.
Findings
Effective interpretation of spatial questions via semantic parsing.
Successful integration of vision, speech, and reasoning components.
System provides answers aligned with human perception.
Abstract
The blocks world is a classic toy domain that has long been used to build and test spatial reasoning systems. Despite its relative simplicity, tackling this domain in its full complexity requires the agent to exhibit a rich set of functional capabilities, ranging from vision to natural language understanding. There is currently a resurgence of interest in solving problems in such limited domains using modern techniques. In this work we tackle spatial question answering in a holistic way, using a vision system, speech input and output mediated by an animated avatar, a dialogue system that robustly interprets spatial queries, and a constraint solver that derives answers based on 3-D spatial modeling. The contributions of this work include a semantic parser that maps spatial questions into logical forms consistent with a general approach to meaning representation, a dialog manager based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Multimodal Machine Learning Applications · Natural Language Processing Techniques
MethodsTest
