On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression
Zichang Ge, Changyu Chen, Arunesh Sinha, Pradeep Varakantham

TL;DR
This paper introduces a novel trajectory embedding method that captures multiple skills from state-action sequences without reward labels, improving generalization and performance across imitation, classification, clustering, and regression tasks.
Contribution
The paper presents a new embedding approach for trajectories that encodes multiple abilities and demonstrates superior downstream task performance without relying on reward signals.
Findings
Embeddings effectively represent skills across diverse tasks.
The method outperforms traditional trajectory encoding approaches.
Latent space exhibits properties like behavior control and additive structure.
Abstract
In real-world sequential decision making tasks like autonomous driving, robotics, and healthcare, learning from observed state-action trajectories is critical for tasks like imitation, classification, and clustering. For example, self-driving cars must replicate human driving behaviors, while robots and healthcare systems benefit from modeling decision sequences, whether or not they come from expert data. Existing trajectory encoding methods often focus on specific tasks or rely on reward signals, limiting their ability to generalize across domains and tasks. Inspired by the success of embedding models like CLIP and BERT in static domains, we propose a novel method for embedding state-action trajectories into a latent space that captures the skills and competencies in the dynamic underlying decision-making processes. This method operates without the need for reward labels, enabling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Neural Networks and Applications · Machine Learning and Data Classification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Layer Normalization · Dense Connections · Linear Warmup With Linear Decay · Adam · Residual Connection · Dropout · Softmax · WordPiece
