Preferenced Oracle Guided Multi-mode Policies for Dynamic Bipedal Loco-Manipulation
Prashanth Ravichandar, Lokesh Krishna, Nikhil Sobanbabu, and Quan Nguyen

TL;DR
This paper introduces OGMP, a unified policy learning approach for dynamic bipedal loco-manipulation tasks, enabling smooth transitions between modes like running, dribbling, and kicking, demonstrated across various robots.
Contribution
It proposes a novel multi-mode policy framework guided by hybrid automatons as oracles, with a preference reward to learn seamless mode transitions for complex loco-manipulation tasks.
Findings
Successful application to soccer and box-moving tasks
Single policy effective across different robot morphologies
Enhanced performance through preference-guided mode transitions
Abstract
Dynamic loco-manipulation calls for effective whole-body control and contact-rich interactions with the object and the environment. Existing learning-based control synthesis relies on training low-level skill policies and explicitly switching with a high-level policy or a hand-designed finite state machine, leading to quasi-static behaviors. In contrast, dynamic tasks such as soccer require the robot to run towards the ball, decelerate to an optimal approach to dribble, and eventually kick a goal - a continuum of smooth motion. To this end, we propose Preferenced Oracle Guided Multi-mode Policies (OGMP) to learn a single policy mastering all the required modes and preferred sequence of transitions to solve uni-object loco-manipulation tasks. We design hybrid automatons as oracles to generate references with continuous dynamics and discrete mode jumps to perform a guided policy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic Technology and Sound Studies · Hydraulic and Pneumatic Systems · Structural Health Monitoring Techniques
