Dual-Control Frequency-Aware Diffusion Model for Depth-Dependent Optical Microrobot Microscopy Image Generation

Lan Wei; Zongcai Tan; Kangyi Lu; Jian-Qing Zheng; Dandan Zhang

arXiv:2604.11680·cs.RO·April 14, 2026

Dual-Control Frequency-Aware Diffusion Model for Depth-Dependent Optical Microrobot Microscopy Image Generation

Lan Wei, Zongcai Tan, Kangyi Lu, Jian-Qing Zheng, Dandan Zhang

PDF

TL;DR

This paper introduces Du-FreqNet, a frequency-aware diffusion model that synthesizes depth-dependent microscopy images of optical microrobots, improving data augmentation for 3D perception tasks.

Contribution

The work presents a dual-control, frequency-aware diffusion framework with adaptive frequency loss for physically consistent microscopy image generation.

Findings

01

Achieves 20.7% higher SSIM than baseline models.

02

Generalizes well to unseen poses and depths.

03

Enhances downstream 3D pose and depth estimation accuracy.

Abstract

Optical microrobots actuated by optical tweezers (OT) are important for cell manipulation and microscale assembly, but their autonomous operation depends on accurate 3D perception. Developing such perception systems is challenging because large-scale, high-quality microscopy datasets are scarce, owing to complex fabrication processes and labor-intensive annotation. Although generative AI offers a promising route for data augmentation, existing generative adversarial network (GAN)-based methods struggle to reproduce key optical characteristics, particularly depth-dependent diffraction and defocus effects. To address this limitation, we propose Du-FreqNet, a dual-control, frequency-aware diffusion model for physically consistent microscopy image synthesis. The framework features two independent ControlNet branches to encode microrobot 3D point clouds and depth-specific mesh layers,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.