ROMA: Run-Time Object Detection To Maximize Real-Time Accuracy
JunKyu Lee, Blesson Varghese, Hans Vandierendonck

TL;DR
This paper introduces ROMA, a run-time model that dynamically selects the best detector among multiple YOLOv4 models to maximize real-time object detection accuracy under varying video content and latency conditions.
Contribution
ROMA is a novel run-time accuracy variation model that optimally selects detectors in real time without label information, improving accuracy in dynamic scenarios.
Findings
ROMA improves real-time detection accuracy by 4-37%.
ROMA outperforms individual detectors and existing runtime techniques.
The model is validated on NVIDIA Jetson Nano with MOT datasets.
Abstract
This paper analyzes the effects of dynamically varying video contents and detection latency on the real-time detection accuracy of a detector and proposes a new run-time accuracy variation model, ROMA, based on the findings from the analysis. ROMA is designed to select an optimal detector out of a set of detectors in real time without label information to maximize real-time object detection accuracy. ROMA utilizing four YOLOv4 detectors on an NVIDIA Jetson Nano shows real-time accuracy improvements by 4 to 37% for a scenario of dynamically varying video contents and detection latency consisting of MOT17Det and MOT20Det datasets, compared to individual YOLOv4 detectors and two state-of-the-art runtime techniques.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
ROMA: Run-Time Object Detection To Maximize Real-Time Accuracy· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · CCD and CMOS Imaging Sensors
MethodsCommunication--Guide||How Do I Communicate to Expedia? · BNB Customer Service Number +1-833-534-1729 · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Softmax · 1x1 Convolution · Bottom-up Path Augmentation · Global Average Pooling · Feature Pyramid Network · k-Means Clustering
