Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos
Md Mohaiminul Islam, Tushar Nagarajan, Huiyu Wang, Fu-Jen Chu, Kris, Kitani, Gedas Bertasius, Xitong Yang

TL;DR
VidAssist introduces a novel framework leveraging large language models for zero/few-shot goal-oriented planning in instructional videos, effectively addressing dataset bias and enhancing generalization in procedural task understanding.
Contribution
The paper presents VidAssist, a new approach that uses LLMs and a breadth-first search algorithm for goal-oriented planning, enabling zero/few-shot performance in instructional video tasks.
Findings
Outperforms previous state-of-the-art by +7.7% in VPA
Achieves +4.81% improvement in PP tasks
Effective in zero-shot and few-shot settings
Abstract
Goal-oriented planning, or anticipating a series of actions that transition an agent from its current state to a predefined objective, is crucial for developing intelligent assistants aiding users in daily procedural tasks. The problem presents significant challenges due to the need for comprehensive knowledge of temporal and hierarchical task structures, as well as strong capabilities in reasoning and planning. To achieve this, prior work typically relies on extensive training on the target dataset, which often results in significant dataset bias and a lack of generalization to unseen tasks. In this work, we introduce VidAssist, an integrated framework designed for zero/few-shot goal-oriented planning in instructional videos. VidAssist leverages large language models (LLMs) as both the knowledge base and the assessment tool for generating and evaluating action plans, thus overcoming…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Games and Gamification · Online and Blended Learning · Educational Tools and Methods
MethodsBalanced Selection
