TL;DR
This paper introduces a new unsupervised domain adaptation method for single-stage artwork recognition in cultural sites, leveraging synthetic data and feature alignment to improve detection performance across real and synthetic images.
Contribution
The study proposes DA-RetinaNet, a novel domain adaptation approach based on RetinaNet and feature alignment, specifically designed for artwork detection in cultural sites.
Findings
DA-RetinaNet outperforms other methods on the new dataset and Cityscapes.
Single-stage detectors show robustness to domain shifts in this context.
The created dataset supports further research in unsupervised domain adaptation for artwork recognition.
Abstract
Recognizing artworks in a cultural site using images acquired from the user's point of view (First Person Vision) allows to build interesting applications for both the visitors and the site managers. However, current object detection algorithms working in fully supervised settings need to be trained with large quantities of labeled data, whose collection requires a lot of times and high costs in order to achieve good performance. Using synthetic data generated from the 3D model of the cultural site to train the algorithms can reduce these costs. On the other hand, when these models are tested with real images, a significant drop in performance is observed due to the differences between real and synthetic images. In this study we consider the problem of Unsupervised Domain Adaptation for object detection in cultural sites. To address this problem, we created a new dataset containing both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocal Loss · 1x1 Convolution · Convolution · Feature Pyramid Network · RetinaNet
