Generalized Grounding Graphs: A Probabilistic Framework for Understanding Grounded Commands
Thomas Kollar, Stefanie Tellex, Matthew Walter, Albert Huang, Abraham, Bachrach, Sachi Hemachandra, Emma Brunskill, Ashis Banerjee, Deb Roy, Seth, Teller, Nicholas Roy

TL;DR
This paper introduces Generalized Grounding Graphs, a probabilistic framework that enables robots to interpret and act upon natural language commands by effectively associating linguistic elements with perceptual features in diverse, real-world environments.
Contribution
The authors propose a novel probabilistic graphical model that dynamically adapts to linguistic parse structures, improving scalability and robustness in grounding natural language commands for robots.
Findings
Robots can learn word meanings from natural language commands.
The framework enables robots to follow commands across various platforms.
Effective association of language with perceptual data demonstrated.
Abstract
Many task domains require robots to interpret and act upon natural language commands which are given by people and which refer to the robot's physical surroundings. Such interpretation is known variously as the symbol grounding problem, grounded semantics and grounded language acquisition. This problem is challenging because people employ diverse vocabulary and grammar, and because robots have substantial uncertainty about the nature and contents of their surroundings, making it difficult to associate the constitutive language elements (principally noun phrases and spatial relations) of the command text to elements of those surroundings. Symbolic models capture linguistic structure but have not scaled successfully to handle the diverse language produced by untrained users. Existing statistical approaches can better handle diversity, but have not to date modeled complex linguistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
