Object-IR: Leveraging Object Consistency and Mesh Deformation for Self-Supervised Image Retargeting

Tianli Liao; Ran Wang; Siqing Zhang; Lei Li; Guangen Liu; Chenyang Zhao; Heling Cao; Peng Li

arXiv:2510.27236·cs.CV·November 3, 2025

Object-IR: Leveraging Object Consistency and Mesh Deformation for Self-Supervised Image Retargeting

Tianli Liao, Ran Wang, Siqing Zhang, Lei Li, Guangen Liu, Chenyang Zhao, Heling Cao, Peng Li

PDF

Open Access

TL;DR

Object-IR introduces a self-supervised mesh warping approach for image retargeting that preserves important objects and geometric features, outperforming existing methods in quality and efficiency.

Contribution

It proposes a novel self-supervised learning framework for image retargeting using mesh deformation guided by object consistency and geometric constraints, eliminating the need for annotated datasets.

Findings

01

Achieves state-of-the-art performance on RetargetMe benchmark.

02

Maintains real-time processing speed (~0.009s for 1024x683 images).

03

Effectively preserves semantic objects and geometric structure in retargeted images.

Abstract

Eliminating geometric distortion in semantically important regions remains an intractable challenge in image retargeting. This paper presents Object-IR, a self-supervised architecture that reformulates image retargeting as a learning-based mesh warping optimization problem, where the mesh deformation is guided by object appearance consistency and geometric-preserving constraints. Given an input image and a target aspect ratio, we initialize a uniform rigid mesh at the output resolution and use a convolutional neural network to predict the motion of each mesh grid and obtain the deformed mesh. The retargeted result is generated by warping the input image according to the rigid mesh in the input image and the deformed mesh in the output resolution. To mitigate geometric distortion, we design a comprehensive objective function incorporating a) object-consistent loss to ensure that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Generative Adversarial Networks and Image Synthesis · Image Enhancement Techniques