CAPER: Constrained and Procedural Reasoning for Robotic Scientific Experiments
Jinghan Yang, Jingyi Hou, Xinbo Yu, Wei He, Yifan Wu

TL;DR
CAPER is a framework that enhances robotic scientific experiments by explicitly enforcing procedural constraints and structured reasoning, leading to improved robustness, controllability, and data efficiency in complex manipulation tasks.
Contribution
It introduces a responsibility-separated planning and control pipeline that enforces procedural correctness and robustness in robotic scientific experiments.
Findings
Improves success rate and procedural correctness in experiments.
Enhances robustness and data efficiency in low-data regimes.
Demonstrates effectiveness on scientific workflow and manipulation benchmarks.
Abstract
Robotic assistance in scientific laboratories requires procedurally correct long-horizon manipulation, reliable execution under limited supervision, and robustness in low-demonstration regimes. Such conditions greatly challenge end-to-end vision-language-action (VLA) models, whose assumptions of recoverable errors and data-driven policy learning often break down in protocol-sensitive experiments. We propose CAPER, a framework for Constrained And ProcEdural Reasoning for robotic scientific experiments, which explicitly restricts where learning and reasoning occur in the planning and control pipeline. Rather than strengthening end-to-end policies, CAPER enforces a responsibility-separated structure: task-level reasoning generates procedurally valid action sequences under explicit constraints, mid-level multimodal grounding realizes subtasks without delegating spatial decision-making to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Robot Manipulation and Learning
