Controlling Your Image via Simplified Vector Graphics
Lanqing Guo, Xi Liu, Yufei Wang, Zhihao Li, Siyu Huang

TL;DR
This paper introduces a novel method for controllable image generation using hierarchical vector graphics, enabling intuitive editing of shapes, colors, and objects with high fidelity.
Contribution
We propose a layer-wise controllable generation framework that parses images into semantic-aligned vector graphics and translates edits into photorealistic outputs.
Findings
Effective image editing and object manipulation demonstrated
High-quality photorealistic outputs from vector graphic edits
Versatile applications in content creation and fine-grained control
Abstract
Recent advances in image generation have achieved remarkable visual quality, while a fundamental challenge remains: Can image generation be controlled at the element level, enabling intuitive modifications such as adjusting shapes, altering colors, or adding and removing objects? In this work, we address this challenge by introducing layer-wise controllable generation through simplified vector graphics (VGs). Our approach first efficiently parses images into hierarchical VG representations that are semantic-aligned and structurally coherent. Building on this representation, we design a novel image synthesis framework guided by VGs, allowing users to freely modify elements and seamlessly translate these edits into photorealistic outputs. By leveraging the structural and semantic features of VGs in conjunction with noise prediction, our method provides precise control over geometry,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis
