TL;DR
This paper introduces a semantic-guided inpainting network that effectively manipulates complex urban scenes by removing and inserting objects while maintaining semantic consistency, addressing challenges of cluttered and ambiguous scenes.
Contribution
The work presents a novel deep learning model that leverages semantic segmentation and a new decoder block to improve inpainting quality in complex urban scenes.
Findings
Successfully manipulates urban scenes with coherent object insertion and removal
Outperforms existing methods on Cityscapes and Indian Driving datasets
Achieves semantically consistent inpainting results
Abstract
Manipulating images of complex scenes to reconstruct, insert and/or remove specific object instances is a challenging task. Complex scenes contain multiple semantics and objects, which are frequently cluttered or ambiguous, thus hampering the performance of inpainting models. Conventional techniques often rely on structural information such as object contours in multi-stage approaches that generate unreliable results and boundaries. In this work, we propose a novel deep learning model to alter a complex urban scene by removing a user-specified portion of the image and coherently inserting a new object (e.g. car or pedestrian) in that scene. Inspired by recent works on image inpainting, our proposed method leverages the semantic segmentation to model the content and structure of the image, and learn the best shape and location of the object to insert. To generate reliable results, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsInpainting
