Omni-IML: Towards Unified Image Manipulation Localization
Chenfan Qu, Yiwu Zhong, Fengjun Guo, Lianwen Jin

TL;DR
Omni-IML introduces a unified model for diverse image manipulation localization tasks, leveraging adaptive encoding, dynamic decoding, and anomaly highlighting, supported by a new large dataset with natural language descriptions.
Contribution
The paper presents Omni-IML, the first generalist model for multiple IML tasks, with novel components for adaptive processing and a new dataset for tampered image interpretation.
Findings
Achieves state-of-the-art performance across four IML tasks.
Demonstrates effective generalization in multi-task image forensics.
Provides a new dataset with natural language descriptions of tampered regions.
Abstract
Existing Image Manipulation Localization (IML) methods mostly rely heavily on task-specific designs, making them perform well only on the target IML task, while joint training on multiple IML tasks causes significant performance degradation, hindering real applications. To this end, we propose Omni-IML, the first generalist model designed to unify IML across diverse tasks. Specifically, Omni-IML achieves generalization through three key components: (1) a Modal Gate Encoder, which adaptively selects the optimal encoding modality per sample, (2) a Dynamic Weight Decoder, which dynamically adjusts decoder filters to the task at hand, and (3) an Anomaly Enhancement module that leverages box supervision to highlight the tampered regions and facilitate the learning of task-agnostic features. Beyond localization, to support interpretation of the tampered images, we construct Omni-273k, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization
