Augmented Commonsense Knowledge for Remote Object Grounding
Bahram Mohammadi, Yicong Hong, Yuankai Qi, Qi Wu, Shirui Pan, Javen, Qinfeng Shi

TL;DR
This paper introduces ACK, a novel model that leverages commonsense knowledge graphs from ConceptNet to improve visual and language understanding for remote object grounding in vision-and-language navigation tasks, achieving state-of-the-art results.
Contribution
The paper proposes an augmented commonsense knowledge model that integrates a refined knowledge graph into navigation agents, enhancing action prediction and data alignment in unseen environments.
Findings
Achieves state-of-the-art performance on REVERIE benchmark.
Effectively incorporates commonsense knowledge to improve navigation accuracy.
Outperforms baseline models significantly.
Abstract
The vision-and-language navigation (VLN) task necessitates an agent to perceive the surroundings, follow natural language instructions, and act in photo-realistic unseen environments. Most of the existing methods employ the entire image or object features to represent navigable viewpoints. However, these representations are insufficient for proper action prediction, especially for the REVERIE task, which uses concise high-level instructions, such as ''Bring me the blue cushion in the master bedroom''. To address enhancing representation, we propose an augmented commonsense knowledge model (ACK) to leverage commonsense information as a spatio-temporal knowledge graph for improving agent navigation. Specifically, the proposed approach involves constructing a knowledge base by retrieving commonsense information from ConceptNet, followed by a refinement module to remove noisy and irrelevant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsData Management and Algorithms · Robotics and Automated Systems · Web Data Mining and Analysis
MethodsBalanced Selection
