InstructRobot: A Model-Free Framework for Mapping Natural Language Instructions into Robot Motion
Iury Cleveston, Alana C. Santana, Paula D. P. Costa, Ricardo R., Gudwin, Alexandre S. Sim\~oes, Esther L. Colombini

TL;DR
InstructRobot is a novel, model-free framework that translates natural language instructions into robot motions using reinforcement learning, capable of handling complex robots without large datasets or detailed kinematic models.
Contribution
It introduces a reinforcement learning-based approach that jointly learns language representations and inverse kinematics, enabling flexible, dataset-free robot instruction mapping.
Findings
Successfully applied to a 26-joint robot in manipulation tasks
Demonstrates robustness and adaptability in realistic environments
Operates without large datasets or prior kinematic knowledge
Abstract
The ability to communicate with robots using natural language is a significant step forward in human-robot interaction. However, accurately translating verbal commands into physical actions is promising, but still presents challenges. Current approaches require large datasets to train the models and are limited to robots with a maximum of 6 degrees of freedom. To address these issues, we propose a framework called InstructRobot that maps natural language instructions into robot motion without requiring the construction of large datasets or prior knowledge of the robot's kinematics model. InstructRobot employs a reinforcement learning algorithm that enables joint learning of language representations and inverse kinematics model, simplifying the entire learning process. The proposed framework is validated using a complex robot with 26 revolute joints in object manipulation tasks,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Multimodal Machine Learning Applications · Speech and dialogue systems
