Interactive Text2Pickup Network for Natural Language based Human-Robot Collaboration
Hyemin Ahn, Sungjoon Choi, Nuri Kim, Geonho Cha, Songhwai Oh

TL;DR
This paper introduces the Interactive Text2Pickup network, which improves human-robot collaboration by understanding ambiguous commands, asking clarifying questions, and accurately estimating object positions for pickup tasks.
Contribution
The paper presents a novel interactive network that handles ambiguous human commands by generating questions and refining object localization for improved robot pickup accuracy.
Findings
Achieves 98.49% accuracy with unambiguous commands.
Increases pickup accuracy by 1.94 times with ambiguous commands after interaction.
Demonstrates effective human-robot communication in object pickup tasks.
Abstract
In this paper, we propose the Interactive Text2Pickup (IT2P) network for human-robot collaboration which enables an effective interaction with a human user despite the ambiguity in user's commands. We focus on the task where a robot is expected to pick up an object instructed by a human, and to interact with the human when the given instruction is vague. The proposed network understands the command from the human user and estimates the position of the desired object first. To handle the inherent ambiguity in human language commands, a suitable question which can resolve the ambiguity is generated. The user's answer to the question is combined with the initial command and given back to the network, resulting in more accurate estimation. The experiment results show that given unambiguous commands, the proposed method can estimate the position of the requested object with an accuracy of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Topic Modeling
