GenTune: Toward Traceable Prompts to Improve Controllability of Image Refinement in Environment Design
Wen-Fan Wang, Ting-Ying Lee, Chien-Ting Lu, Che-Wei Hsu, Nil Ponsa Campany\`a, Yu Chen, Mike Y. Chen, Bing-Yu Chen

TL;DR
GenTune improves controllability and traceability in AI-assisted environment design by enabling designers to directly link image elements to prompt labels for precise global edits, enhancing workflow efficiency and output quality.
Contribution
This paper introduces GenTune, a novel system that allows designers to trace image elements back to prompts for targeted editing, addressing challenges of prompt complexity and global consistency in AI-driven design workflows.
Findings
Significantly improved prompt-image comprehension and refinement quality
Enhanced efficiency and user satisfaction in image editing tasks
Effective real-world application demonstrated in studio settings
Abstract
Environment designers in the entertainment industry create imaginative 2D and 3D scenes for games, films, and television, requiring both fine-grained control of specific details and consistent global coherence. Designers have increasingly integrated generative AI into their workflows, often relying on large language models (LLMs) to expand user prompts for text-to-image generation, then iteratively refining those prompts and applying inpainting. However, our formative study with 10 designers surfaced two key challenges: (1) the lengthy LLM-generated prompts make it difficult to understand and isolate the keywords that must be revised for specific visual elements; and (2) while inpainting supports localized edits, it can struggle with global consistency and correctness. Based on these insights, we present GenTune, an approach that enhances human--AI collaboration by clarifying how…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
