Loading paper
ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities | Tomesphere