Efficiently Generating Expressive Quadruped Behaviors via   Language-Guided Preference Learning

Jaden Clark; Joey Hejna; Dorsa Sadigh

arXiv:2502.03717·cs.RO·April 2, 2025

Efficiently Generating Expressive Quadruped Behaviors via Language-Guided Preference Learning

Jaden Clark, Joey Hejna, Dorsa Sadigh

PDF

Open Access

TL;DR

This paper presents LGPL, a novel method combining pre-trained language models and preference learning to efficiently generate expressive quadruped behaviors aligned with human expectations, requiring fewer interactions.

Contribution

Introducing LGPL, which leverages LLM priors to guide preference learning, significantly improving sample efficiency in generating expressive robot behaviors.

Findings

01

LGPL learns behaviors with as few as four queries.

02

Outperforms purely language-based and traditional preference methods.

03

Achieves rapid, accurate, and expressive quadruped behaviors.

Abstract

Expressive robotic behavior is essential for the widespread acceptance of robots in social environments. Recent advancements in learned legged locomotion controllers have enabled more dynamic and versatile robot behaviors. However, determining the optimal behavior for interactions with different users across varied scenarios remains a challenge. Current methods either rely on natural language input, which is efficient but low-resolution, or learn from human preferences, which, although high-resolution, is sample inefficient. This paper introduces a novel approach that leverages priors generated by pre-trained LLMs alongside the precision of preference learning. Our method, termed Language-Guided Preference Learning (LGPL), uses LLMs to generate initial behavior samples, which are then refined through preference-based feedback to learn behaviors that closely align with human…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Video Analysis and Summarization

MethodsALIGN