OrigamiBench: An Interactive Environment to Synthesize Flat-Foldable Origamis
Naaisha Agarwal, Yihan Wu, Yichang Jian, Yikuan Hu, Nishad Mansoor, Mohan Li, Yifei Peng, Wang-Zhou Dai, Yao-Xiang Ding, Emanuele Sansone

TL;DR
OrigamiBench is an interactive environment that evaluates AI models' ability to understand and plan complex physical folding tasks, integrating visual perception, reasoning, and sequential decision-making.
Contribution
The paper introduces OrigamiBench, a novel benchmark that combines visual and symbolic reasoning in a structured folding task for AI evaluation.
Findings
Scaling model size does not improve causal reasoning.
Models struggle to generate multi-step folding strategies.
Visual and language representations are weakly integrated.
Abstract
Building AI systems that can plan, act, and create in the physical world requires more than pattern recognition. Such systems must understand the causal mechanisms and constraints governing physical processes in order to guide sequential decisions. This capability relies on internal representations, analogous to an internal language model, that relate observations, actions, and resulting environmental changes. However, many existing benchmarks treat visual perception and programmatic reasoning as separate problems, focusing either on visual recognition or on symbolic tasks. The domain of origami provides a natural testbed that integrates these modalities. Constructing shapes through folding operations requires visual perception, reasoning about geometric and physical constraints, and sequential planning, while remaining sufficiently structured for systematic evaluation. We introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Materials and Mechanics · Modular Robots and Swarm Intelligence · Architecture and Computational Design
