Multi-scale Semantic Prior Features Guided Deep Neural Network for Urban Street-view Image
Jianshun Zeng, Wang Li, Yanjie Lv, Shuai Gao, YuChu Qin

TL;DR
This paper introduces a multi-scale semantic prior feature guided deep neural network for street-view image inpainting, improving the generation of plausible, privacy-preserving urban scenes by leveraging rich semantic priors and attention mechanisms.
Contribution
The paper proposes a novel DNN architecture with a semantic prior prompter and cascaded LPT modules, enhancing global context understanding and structure restoration in street-view image inpainting.
Findings
Outperforms state-of-the-art methods on Apolloscapes and Cityscapes datasets.
Achieves approximately 9.5% improvement in MAE and 41.07% in LPIPS.
Effectively prevents hallucinated objects, ensuring reliable urban scene generation.
Abstract
Street-view image has been widely applied as a crucial mobile mapping data source. The inpainting of street-view images is a critical step for street-view image processing, not only for the privacy protection, but also for the urban environment mapping applications. This paper presents a novel Deep Neural Network (DNN), multi-scale semantic prior Feature guided image inpainting Network (MFN) for inpainting street-view images, which generate static street-view images without moving objects (e.g., pedestrians, vehicles). To enhance global context understanding, a semantic prior prompter is introduced to learn rich semantic priors from large pre-trained model. We design the prompter by stacking multiple Semantic Pyramid Aggregation (SPA) modules, capturing a broad range of visual feature patterns. A semantic-enhanced image generator with a decoder is proposed that incorporates a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Remote Sensing and Land Use · Traffic Prediction and Management Techniques
MethodsInpainting · Masked autoencoder
