COLLAGE: Adaptive Fusion-based Retrieval for Augmented Policy Learning
Sateesh Kumar, Shivin Dass, Georgios Pavlakos, Roberto Mart\'in-Mart\'in

TL;DR
COLLAGE introduces an adaptive fusion method for selecting relevant demonstrations in few-shot imitation learning, improving policy performance by combining multiple retrieval cues and importance sampling.
Contribution
It proposes a flexible, feature-agnostic retrieval and weighting scheme that enhances demonstration selection for better policy learning in few-shot settings.
Findings
Outperforms state-of-the-art methods by 5.1% in simulation
Achieves 16.6% improvement in real-world tasks
Effective across multiple datasets and task types
Abstract
In this work, we study the problem of data retrieval for few-shot imitation learning: selecting data from a large dataset to train a performant policy for a specific task, given only a few target demonstrations. Prior methods retrieve data using a single-feature distance heuristic, assuming that the best demonstrations are those that most closely resemble the target examples in visual, semantic, or motion space. However, this approach captures only a subset of the relevant information and can introduce detrimental demonstrations, e.g., retrieving data from unrelated tasks due to similar scene layouts, or selecting similar motions from tasks with divergent goals. We present COLLAGE, a method for COLLective data AGgrEgation in few-shot imitation learning that uses an adaptive late fusion mechanism to guide the selection of relevant demonstrations based on a task-specific combination of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
