Language-Guided Multimodal Texture Authoring via Generative Models

Wanli Qian; Aiden Chang; Shihan Lu; Michael Gu; Heather Culbertson

arXiv:2604.06489·cs.HC·April 9, 2026

Language-Guided Multimodal Texture Authoring via Generative Models

Wanli Qian, Aiden Chang, Shihan Lu, Michael Gu, Heather Culbertson

PDF

TL;DR

This paper introduces a language-guided system that generates multimodal textures, combining haptic and visual outputs from natural language prompts, streamlining texture design and enabling intuitive, prompt-based control.

Contribution

It presents a novel multimodal texture authoring framework that links haptic and visual modalities through shared language-aligned latent representations, facilitating semantic consistency and user-friendly design.

Findings

01

Participants rated textures consistent with prompts for roughness, hardness, and slipperiness.

02

The system enables immediate visual and tactile feedback from a single natural language prompt.

03

Language effectively controls both haptic and visual texture attributes, supporting a prompt-first workflow.

Abstract

Authoring realistic haptic textures typically requires low-level parameter tuning and repeated trial-and-error, limiting speed, transparency, and creative reach. We present a language-driven authoring system that turns natural-language prompts into multimodal textures: two coordinated haptic channels - sliding vibrations via force/speed-conditioned autoregressive (AR) models and tapping transients - and a text-prompted visual preview from a diffusion model. A shared, language-aligned latent links modalities so a single prompt yields semantically consistent haptic and visual signals; designers can write goals (e.g., "gritty but cushioned surface," "smooth and hard metal surface") and immediately see and feel the result through a 3D haptic device. To verify that the learned latent encodes perceptually meaningful structure, we conduct an anchor-referenced, attribute-wise evaluation for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.