Uncovering the Background-Induced bias in RGB based 6-DoF Object Pose Estimation
Elena Govi, Davide Sapienza, Carmelo Scribano, Tobia Poppi, Giorgia, Franchini, Paola Ard\`on, Micaela Verucchi, Marko Bertogna

TL;DR
This paper investigates how artificial markers in datasets influence 6-DoF pose estimation accuracy, revealing biases in neural network focus and proposing methods to mitigate marker-induced effects for more robust real-world applications.
Contribution
The study uncovers the bias introduced by ArUco markers in the Linemod dataset and proposes a new dataset and mitigation strategies to improve pose estimation robustness.
Findings
Markers significantly influence neural network focus during pose estimation.
Presence of markers can bias the model, reducing real-world applicability.
Data augmentation can help mitigate marker-induced bias.
Abstract
In recent years, there has been a growing trend of using data-driven methods in industrial settings. These kinds of methods often process video images or parts, therefore the integrity of such images is crucial. Sometimes datasets, e.g. consisting of images, can be sophisticated for various reasons. It becomes critical to understand how the manipulation of video and images can impact the effectiveness of a machine learning method. Our case study aims precisely to analyze the Linemod dataset, considered the state of the art in 6D pose estimation context. That dataset presents images accompanied by ArUco markers; it is evident that such markers will not be available in real-world contexts. We analyze how the presence of the markers affects the pose estimation accuracy, and how this bias may be mitigated through data augmentation and other methods. Our work aims to show how the presence of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection · Advanced Neural Network Applications · 3D Surveying and Cultural Heritage
