Cross-Domain Demo-to-Code via Neurosymbolic Counterfactual Reasoning
Jooyoung Kim, Wonje Choi, Younguk Song, Honguk Woo

TL;DR
This paper presents NeSyCR, a neurosymbolic framework that improves cross-domain robotic programming by reasoning about procedural differences, leading to significant success rate improvements in manipulation tasks.
Contribution
NeSyCR introduces a novel neurosymbolic counterfactual reasoning approach for verifiable cross-domain adaptation in video-instructed robotic programming.
Findings
Achieves 31.14% higher task success than baseline.
Effectively adapts to both simulated and real-world tasks.
Provides verifiable procedural revisions for domain mismatches.
Abstract
Recent advances in Vision-Language Models (VLMs) have enabled video-instructed robotic programming, allowing agents to interpret video demonstrations and generate executable control code. We formulate video-instructed robotic programming as a cross-domain adaptation problem, where perceptual and physical differences between demonstration and deployment induce procedural mismatches. However, current VLMs lack the procedural understanding needed to reformulate causal dependencies and achieve task-compatible behavior under such domain shifts. We introduce NeSyCR, a neurosymbolic counterfactual reasoning framework that enables verifiable adaptation of task procedures, providing a reliable synthesis of code policies. NeSyCR abstracts video demonstrations into symbolic trajectories that capture the underlying task procedure. Given deployment observations, it derives counterfactual states that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robot Manipulation and Learning · Reinforcement Learning in Robotics
