Cross-Modal Generation: From Commodity WiFi to High-Fidelity mmWave and RFID Sensing

Zhixiong Yang; Long Jing; Yao Li; Shuli Cheng; Guoxuan Chi; Chenyu Wen

arXiv:2604.16558·cs.LG·April 21, 2026

Cross-Modal Generation: From Commodity WiFi to High-Fidelity mmWave and RFID Sensing

Zhixiong Yang, Long Jing, Yao Li, Shuli Cheng, Guoxuan Chi, Chenyu Wen

PDF

TL;DR

This paper introduces RF-CMG, a diffusion-based method that synthesizes high-fidelity RF data for scarce modalities like mmWave and RFID using abundant WiFi data, improving generative quality and downstream tasks.

Contribution

RF-CMG decouples cross-modal generation into high-frequency guidance and low-frequency constraints, enabling high-quality synthesis with limited target modality data.

Findings

01

RF-CMG outperforms existing generative models in RFID and mmWave signal synthesis.

02

Synthetic data from RF-CMG improves gesture recognition accuracy.

03

The method effectively suppresses structural biases during generation.

Abstract

AIGC has shown remarkable success in CV and NLP, and has recently demonstrated promising potential in the wireless domain. However, significant data imbalance exists across RF modalities, with abundant WiFi data but scarce mmWave and RFID data due to high acquisition cost. This makes it difficult to train high-quality generative models for these data-scarce modalities. In this work, we propose RF-CMG, a diffusion-based cross-modal generative method that leverages data-rich WiFi signals to synthesize high-fidelity RF data for scarce modalities including mmWave and RFID. The key insight of RF-CMG is to decouple cross-modal generation into high-frequency guidance and low-frequency constraint, which respectively learn high-frequency distribution from limited target modality data and preserve the underlying physical structure via low-frequency constraints during generation. On this basis, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.