Small Object Detection for Near Real-Time Egocentric Perception in a Manual Assembly Scenario
Hooman Tavakoli, Snehal Walunj, Parsha Pahlevannejad, Christiane, Plociennik, and Martin Ruskowski

TL;DR
This paper presents a near real-time small object detection method for egocentric AR in industrial assembly, leveraging synthetic data from CAD models and a two-stage YOLOv4 approach to improve detection accuracy.
Contribution
It introduces a pipeline that uses CAD-based synthetic training data and a two-stage detection process tailored for egocentric AR scenarios.
Findings
Effective small object detection in near real-time
Utilizes synthetic data from CAD models for training
Two-stage detection improves accuracy in industrial AR
Abstract
Detecting small objects in video streams of head-worn augmented reality devices in near real-time is a huge challenge: training data is typically scarce, the input video stream can be of limited quality, and small objects are notoriously hard to detect. In industrial scenarios, however, it is often possible to leverage contextual knowledge for the detection of small objects. Furthermore, CAD data of objects are typically available and can be used to generate synthetic training data. We describe a near real-time small object detection pipeline for egocentric perception in a manual assembly scenario: We generate a training data set based on CAD data and realistic backgrounds in Unity. We then train a YOLOv4 model for a two-stage detection process: First, the context is recognized, then the small object of interest is detected. We evaluate our pipeline on the augmented reality device…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection · Robot Manipulation and Learning · Manufacturing Process and Optimization
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Feature Pyramid Network · Grid Sensitive · (TravEL!!Guide)How Do I File a Claim with Expedia? · Softmax · 1x1 Convolution · k-Means Clustering · Sigmoid Activation · DropBlock · Cosine Annealing
