Flow-Enabled Generalization to Human Demonstrations in Few-Shot Imitation Learning

Runze Tang; Penny Sweetser

arXiv:2602.10594·cs.RO·March 2, 2026

Flow-Enabled Generalization to Human Demonstrations in Few-Shot Imitation Learning

Runze Tang, Penny Sweetser

PDF

Open Access

TL;DR

This paper introduces SFCrP, a novel approach combining scene flow prediction and flow-conditioned policies to enable robots to learn from human demonstrations, improving generalization and reducing demonstration requirements.

Contribution

The paper presents SFCrP, a new method that leverages scene flow prediction and flow-conditioned policies for better generalization in imitation learning from human videos.

Findings

01

Outperforms state-of-the-art baselines in real-world tasks

02

Shows strong spatial and instance generalization to unseen scenarios

03

Reduces the number of robot demonstrations needed

Abstract

Imitation Learning (IL) enables robots to learn complex skills from demonstrations without explicit task modeling, but it typically requires large amounts of demonstrations, creating significant collection costs. Prior work has investigated using flow as an intermediate representation to enable the use of human videos as a substitute, thereby reducing the amount of required robot demonstrations. However, most prior work has focused on the flow, either on the object or on specific points of the robot/hand, which cannot describe the motion of interaction. Meanwhile, relying on flow to achieve generalization to scenarios observed only in human videos remains limited, as flow alone cannot capture precise motion details. Furthermore, conditioning on scene observation to produce precise actions may cause the flow-conditioned policy to overfit to training tasks and weaken the generalization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Reinforcement Learning in Robotics