A Dataset and Benchmarks for Deep Learning-Based Optical Microrobot Pose and Depth Perception
Lan Wei, Dandan Zhang

TL;DR
This paper introduces the OTMR dataset, a large collection of images for training and benchmarking deep learning models to accurately perceive the pose and depth of optical microrobots under microscopes, addressing key challenges in biomedicine applications.
Contribution
The paper presents the first publicly available dataset for microrobot perception, benchmarks multiple deep learning models, and demonstrates the impact of dataset size and architecture on performance.
Findings
Vision Transformer achieves highest pose classification accuracy.
Deeper architectures improve depth regression performance.
Larger training datasets significantly enhance model accuracy.
Abstract
Optical microrobots, manipulated via optical tweezers (OT), have broad applications in biomedicine. However, reliable pose and depth perception remain fundamental challenges due to the transparent or low-contrast nature of the microrobots, as well as the noisy and dynamic conditions of the microscale environments in which they operate. An open dataset is crucial for enabling reproducible research, facilitating benchmarking, and accelerating the development of perception models tailored to microscale challenges. Standardised evaluation enables consistent comparison across algorithms, ensuring objective benchmarking and facilitating reproducible research. Here, we introduce the OpTical MicroRobot dataset (OTMR), the first publicly available dataset designed to support microrobot perception under the optical microscope. OTMR contains 232,881 images spanning 18 microrobot types and 176…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications
