Point-MF: One-step Point Cloud Generation from a Single Image via Mean Flows

Yuta Baba; Keiji Yanai

arXiv:2604.24586·cs.CV·April 28, 2026

Point-MF: One-step Point Cloud Generation from a Single Image via Mean Flows

Yuta Baba, Keiji Yanai

PDF

TL;DR

Point-MF introduces a fast, one-step point cloud reconstruction method from a single image using mean flows and a specialized transformer, achieving high quality with minimal inference time.

Contribution

The paper presents a novel one-step point cloud reconstruction framework that operates directly in point-cloud space using mean flows, eliminating the need for multiple denoising iterations.

Findings

01

Achieves high-quality point cloud reconstruction with millisecond latency.

02

Outperforms multi-step diffusion models in speed while maintaining quality.

03

Stabilizes large-step generation with a new auxiliary loss.

Abstract

Single-image point cloud reconstruction must infer complete 3D geometry, including occluded parts, from a single RGB image. While diffusion-based reconstructors achieve high accuracy, they typically require many denoising iterations, resulting in slow and expensive inference. We propose Point-MF, a Mean-Flow-based framework for low-NFE single-image point cloud reconstruction that couples a Mean-Flow-compatible architecture with an auxiliary loss. Specifically, Point-MF operates directly in point-cloud space to learn the mean velocity field and enables one-step reconstruction with a single network function evaluation (1-NFE), without relying on VAE-based latent representations. To make Mean Flow effective under large interval jumps, Point-MF employs a Diffusion Transformer tailored to the Mean-Flow setting, conditioned on frozen DINOv3 image features via a lightweight token adapter and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.