Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Tiancheng Shen, Jun Hao Liew, Long Mai, Lu Qi, Jiashi Feng, Jiaya Jia

TL;DR
This paper introduces a new task called Image Editing Recommendation (IER) that automatically generates diverse, creative editing instructions from an image and a vague user prompt, supported by a multimodal framework called Creativity-VLA.
Contribution
The paper proposes the IER task, creates a dedicated dataset, and develops Creativity-VLA with a novel token-for-localization mechanism for improved edit-instruction generation.
Findings
Creativity-VLA effectively generates relevant, creative editing instructions.
The model outperforms baselines in relevance and diversity.
The token-for-localization mechanism enhances local editing support.
Abstract
Advances in text-based image generation and editing have revolutionized content creation, enabling users to create impressive content from imaginative text prompts. However, existing methods are not designed to work well with the oversimplified prompts that are often encountered in typical scenarios when users start their editing with only vague or abstract purposes in mind. Those scenarios demand elaborate ideation efforts from the users to bridge the gap between such vague starting points and the detailed creative ideas needed to depict the desired results. In this paper, we introduce the task of Image Editing Recommendation (IER). This task aims to automatically generate diverse creative editing instructions from an input image and a simple prompt representing the users' under-specified editing purpose. To this end, we introduce Creativity-Vision Language Assistant~(Creativity-VLA),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Recommender Systems and Techniques
