AIM-Bench: Benchmarking and Improving Affective Image Manipulation via Fine-Grained Hierarchical Control

Shi Chen; Xuecheng Wu; Heli Sun; Yunyun Shi; Xinyi Yin; Fengjian Xue; Jinheng Xie; Dingkang Yang; Hao Wang; Junxiao Xue; Liang He

arXiv:2604.10454·cs.CV·April 14, 2026

AIM-Bench: Benchmarking and Improving Affective Image Manipulation via Fine-Grained Hierarchical Control

Shi Chen, Xuecheng Wu, Heli Sun, Yunyun Shi, Xinyi Yin, Fengjian Xue, Jinheng Xie, Dingkang Yang, Hao Wang, Junxiao Xue, Liang He

PDF

TL;DR

This paper introduces AIM-Bench, a comprehensive benchmark for affective image manipulation that includes a new dataset, evaluation metrics, and insights into current model limitations and improvements.

Contribution

It presents the first dedicated benchmark for AIM, a balanced dataset AIM-40k, and a hierarchical evaluation framework to advance affective image editing research.

Findings

01

Current models struggle with positivity bias in AIM tasks.

02

The AIM-40k dataset improves model performance by 9.15%.

03

Hierarchical human-in-the-loop workflow enhances sample quality.

Abstract

Affective Image Manipulation (AIM) aims to evoke specific emotions through targeted editing. Current image editing benchmarks primarily focus on object-level modifications in general scenarios, lacking the fine-grained granularity to capture affective dimensions. To bridge this gap, we introduce the first benchmark designed for AIM termed AIM-Bench. This benchmark is built upon a dual-path affective modeling scheme that integrates the Mikels emotion taxonomy with the Valence-Arousal-Dominance framework, enabling high-level semantic and fine-grained continuous manipulation. Through a hierarchical human-in-the-loop workflow, we finally curate 800 high-quality samples covering 8 emotional categories and 5 editing types. To effectively assess performance, we also design a composite evaluation suite combining rule-based and model-based metrics to holistically assess instruction consistency,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.