SIRL: Similarity-based Implicit Representation Learning
Andreea Bobu, Yi Liu, Rohin Shah, Daniel S. Brown, Anca D. Dragan

TL;DR
This paper introduces SIRL, a method that learns task-relevant representations by querying user-defined similarities between behaviors, improving generalization over traditional self-supervised approaches.
Contribution
It proposes a novel similarity-based learning approach that leverages user input to identify causal features, enhancing representation learning for robotic reward functions.
Findings
Learned representations are more generalizable than self-supervised methods.
Similarity queries effectively isolate task-relevant features.
Method improves robustness to spurious correlations.
Abstract
When robots learn reward functions using high capacity models that take raw state directly as input, they need to both learn a representation for what matters in the task -- the task ``features" -- as well as how to combine these features into a single objective. If they try to do both at once from input designed to teach the full reward function, it is easy to end up with a representation that contains spurious correlations in the data, which fails to generalize to new settings. Instead, our ultimate goal is to enable robots to identify and isolate the causal features that people actually care about and use when they represent states and behavior. Our idea is that we can tune into this representation by asking users what behaviors they consider similar: behaviors will be similar if the features that matter are similar, even if low-level behavior is different; conversely, behaviors will…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
