M^3ashy: Multi-Modal Material Synthesis via Hyperdiffusion

Chenliang Zhou; Zheyuan Hu; Alejandro Sztrajman; Yancheng Cai; Yaru Liu; Cengiz Oztireli

arXiv:2411.12015·cs.GR·March 24, 2026

M^3ashy: Multi-Modal Material Synthesis via Hyperdiffusion

Chenliang Zhou, Zheyuan Hu, Alejandro Sztrajman, Yancheng Cai, Yaru Liu, Cengiz Oztireli

PDF

1 Datasets 1 Video

TL;DR

M^3ashy is a multi-modal hyperdiffusion framework for high-quality synthesis of real-world materials, leveraging neural fields and enabling flexible control via material type, language, or images.

Contribution

It introduces a novel hyperdiffusion model for BRDF synthesis, incorporating multi-modal conditioning and new datasets with evaluation metrics.

Findings

01

Effective reconstruction of complex real-world materials.

02

Flexible synthesis conditioned on multiple modalities.

03

Demonstrated superiority through extensive experiments.

Abstract

High-quality material synthesis is essential for replicating complex surface properties to create realistic scenes. Despite advances in the generation of material appearance based on analytic models, the synthesis of real-world measured BRDFs remains largely unexplored. To address this challenge, we propose M^3ashy, a novel multi-modal material synthesis framework based on hyperdiffusion. M^3ashy enables high-quality reconstruction of complex real-world materials by leveraging neural fields as a compact continuous representation of BRDFs. Furthermore, our multi-modal conditional hyperdiffusion model allows for flexible material synthesis conditioned on material type, natural language descriptions, or reference images, providing greater user control over material generation. To support future research, we contribute two new material datasets and introduce two BRDF distributional metrics…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Peter2023HuggingFace/NeuMERL
dataset· 134 dl
134 dl

Videos

M3ashy: Multi-Modal Material Synthesis via Hyperdiffusion· underline