RaPD: Resolution-Agnostic Pixel Diffusion via Semantics-Enriched Implicit Representations

Yanhao Ge; Shanyan Guan; Weihao Wang; Ying Tai; Mingyu You

arXiv:2605.15908·cs.CV·May 18, 2026

RaPD: Resolution-Agnostic Pixel Diffusion via Semantics-Enriched Implicit Representations

Yanhao Ge, Shanyan Guan, Weihao Wang, Ying Tai, Mingyu You

PDF

TL;DR

RaPD introduces a resolution-agnostic generative model that performs diffusion in a continuous neural image field, enabling high-quality, scale-aware image synthesis at arbitrary resolutions.

Contribution

It proposes a novel diffusion approach in a continuous neural field with semantic guidance and coordinate-based rendering, improving resolution scalability and generation quality.

Findings

01

Superior generation quality demonstrated across resolutions

02

Enables arbitrary resolution rendering with fixed diffusion cost

03

Bridges the gap between continuous rendering and discrete latent spaces

Abstract

Natural images are continuous, yet most generative models synthesize them on discrete grids, limiting resolution-flexible generation. Continuous neural fields enable resolution-free rendering, but prior methods introduce continuity only at the decoding stage as an interpolation module, leaving the generative latent space discretized and reconstruction-oriented. We propose RaPD (Resolution-agnostic Pixel Diffusion), which performs diffusion in a continuous Neural Image Field (NIF) latent space. RaPD bridges this reconstruction-generation gap with Semantic Representation Guidance for generation-aware latent learning and a Coordinate-Queried Attention Renderer for coordinate-conditioned, scale-aware rendering. A single denoised latent can be rendered at arbitrary resolutions by changing only the query coordinates, keeping diffusion cost fixed. Experiments demonstrate superior generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.