UniSER: A Foundation Model for Unified Soft Effects Removal

Jingdong Zhang; Lingzhi Zhang; Qing Liu; Mang Tik Chiu; Connelly Barnes; Yizhou Wang; Haoran You; Xiaoyang Liu; Yuqian Zhou; Zhe Lin; Eli Shechtman; Sohrab Amirghodsi; Xin Li; Wenping Wang; Xiaohang Zhan

arXiv:2511.14183·cs.CV·April 29, 2026

UniSER: A Foundation Model for Unified Soft Effects Removal

Jingdong Zhang, Lingzhi Zhang, Qing Liu, Mang Tik Chiu, Connelly Barnes, Yizhou Wang, Haoran You, Xiaoyang Liu, Yuqian Zhou, Zhe Lin, Eli Shechtman, Sohrab Amirghodsi, Xin Li, Wenping Wang, Xiaohang Zhan

PDF

TL;DR

UniSER is a versatile foundation model designed to effectively remove various soft effects like haze, reflections, and shadows from images within a unified framework, leveraging a large dataset and diffusion transformer architecture.

Contribution

The paper introduces UniSER, a novel unified model for soft effects removal, trained on a large dataset, outperforming specialized and generalist models in robustness and fidelity.

Findings

01

UniSER outperforms existing models in soft effects removal tasks.

02

The model demonstrates high robustness and generalization in real-world scenarios.

03

A new large-scale dataset with physically-plausible data enhances training effectiveness.

Abstract

Digital images are often degraded by soft effects such as lens flare, haze, shadows, and reflections, which reduce aesthetics even though the underlying pixels remain partially visible. The prevailing works address these degradations in isolation, developing highly specialized, specialist models that lack scalability and fail to exploit the shared underlying essences of these restoration problems. Meanwhile, although recent large-scale generalist models (e.g., GPT-4o, Flux Kontext, Nano Banana) offer powerful text-driven editing capabilities, they heavily rely on detailed prompts and often fail to achieve robust removal on such fine-grained tasks while preserving the scene's identity. Leveraging the common essence of soft effects, i.e., semi-transparent occlusions, we introduce a foundational versatile model UniSER, capable of addressing diverse degradations caused by soft effects…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.