CHIMERA: Adaptive Cache Injection and Semantic Anchor Prompting for Zero-shot Image Morphing with Morphing-oriented Metrics
Dahyeon Kye, Jeahun Sung, Minkyu Jeon, Jihyong Oh

TL;DR
CHIMERA is a zero-shot diffusion-based image morphing framework that enhances stability and semantic consistency through adaptive feature reuse, semantic anchoring, and a novel morphing metric, outperforming prior methods.
Contribution
The paper introduces CHIMERA, combining inversion-guided denoising with adaptive feature reuse and semantic anchoring, enabling stable, semantically coherent zero-shot image morphing without retraining.
Findings
Produces smoother, more consistent morphs than prior methods
Effective across diverse diffusion models without retraining
Improves intermediate semantic coherence and stability
Abstract
Recent diffusion-based image morphing methods typically interpolate inverted latents and reuse limited conditioning signals, which often yields unstable intermediates for heterogeneous endpoint pairs. In particular, (i) feature reuse is usually partial or non-adaptive, leading to abrupt structural changes or over-smoothing, and (ii) text conditions are commonly obtained independently per endpoint and then interpolated, which can introduce incompatible semantics. We present CHIMERA, a novel zero-shot diffusion morphing framework that addresses both issues via inversion-guided denoising with complementary feature reuse and text conditioning. ACI caches a broader set of multi-scale diffusion features beyond Key--Value-only reuse during DDIM inversion, and re-injects them with layer- and timestep-aware scheduling to stabilize denoising and enable gradual fusion. Semantic Anchor Prompting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · 3D Shape Modeling and Analysis
