MEMTRACK: A Deep Learning-Based Approach to Microrobot Tracking in Dense and Low-Contrast Environments
Medha Sawhney, Bhas Karmarkar, Eric J. Leaman, Arka Daw, Anuj, Karpatne, Bahareh Behkam

TL;DR
MEMTrack is a deep learning pipeline that accurately detects and tracks microrobots in dense, low-contrast environments, outperforming manual annotation and enabling advanced microrobot control.
Contribution
This work introduces MEMTrack, a novel deep learning-based tracking system specifically designed for challenging dense and low-contrast environments, with publicly available source code.
Findings
Achieves 77% precision and 48% recall in collagen environments.
Tracks bacteria speeds accurately with no significant difference from manual tracking.
Outperforms human annotators in challenging conditions.
Abstract
Tracking microrobots is challenging, considering their minute size and high speed. As the field progresses towards developing microrobots for biomedical applications and conducting mechanistic studies in physiologically relevant media (e.g., collagen), this challenge is exacerbated by the dense surrounding environments with feature size and shape comparable to microrobots. Herein, we report Motion Enhanced Multi-level Tracker (MEMTrack), a robust pipeline for detecting and tracking microrobots using synthetic motion features, deep learning-based object detection, and a modified Simple Online and Real-time Tracking (SORT) algorithm with interpolation for tracking. Our object detection approach combines different models based on the object's motion pattern. We trained and validated our model using bacterial micro-motors in collagen (tissue phantom) and tested it in collagen and aqueous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMolecular Communication and Nanonetworks · Micro and Nano Robotics · Cell Image Analysis Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
