MoGraphGPT: Creating Interactive Scenes Using Modular LLM and Graphical Control
Hui Ye, Chufeng Xiao, Jiaye Leng, Pengfei Xu, Hongbo Fu

TL;DR
MoGraphGPT combines modular large language models with graphical controls to simplify and improve the creation of complex 2D interactive scenes without coding, offering precise, independent element editing and enhanced visual integration.
Contribution
This paper introduces a novel modular LLM framework with graphical interface for interactive scene creation, addressing errors and control limitations of traditional LLM-based code generation.
Findings
Significantly improves ease of scene creation
Enhances controllability and refinement of visual elements
Outperforms baseline in usability and precision
Abstract
Creating interactive scenes often involves complex programming tasks. Although large language models (LLMs) like ChatGPT can generate code from natural language, their output is often error-prone, particularly when scripting interactions among multiple elements. The linear conversational structure limits the editing of individual elements, and lacking graphical and precise control complicates visual integration. To address these issues, we integrate an element-level modularization technique that processes textual descriptions for individual elements through separate LLM modules, with a central module managing interactions among elements. This modular approach allows for refining each element independently. We design a graphical user interface, MoGraphGPT , which combines modular LLMs with enhanced graphical control to generate codes for 2D interactive scenes. It enables direct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Natural Language Processing Techniques · Topic Modeling
