An Unsupervised Domain Adaptation Scheme for Single-Stage Artwork   Recognition in Cultural Sites

Giovanni Pasqualino; Antonino Furnari; Giovanni Signorello and; Giovanni Maria Farinella

arXiv:2008.01882·cs.CV·December 23, 2020

An Unsupervised Domain Adaptation Scheme for Single-Stage Artwork Recognition in Cultural Sites

Giovanni Pasqualino, Antonino Furnari, Giovanni Signorello and, Giovanni Maria Farinella

PDF

1 Repo

TL;DR

This paper introduces a new unsupervised domain adaptation method for single-stage artwork recognition in cultural sites, leveraging synthetic data and feature alignment to improve detection performance across real and synthetic images.

Contribution

The study proposes DA-RetinaNet, a novel domain adaptation approach based on RetinaNet and feature alignment, specifically designed for artwork detection in cultural sites.

Findings

01

DA-RetinaNet outperforms other methods on the new dataset and Cityscapes.

02

Single-stage detectors show robustness to domain shifts in this context.

03

The created dataset supports further research in unsupervised domain adaptation for artwork recognition.

Abstract

Recognizing artworks in a cultural site using images acquired from the user's point of view (First Person Vision) allows to build interesting applications for both the visitors and the site managers. However, current object detection algorithms working in fully supervised settings need to be trained with large quantities of labeled data, whose collection requires a lot of times and high costs in order to achieve good performance. Using synthetic data generated from the 3D model of the cultural site to train the algorithms can reduce these costs. On the other hand, when these models are tested with real images, a significant drop in performance is observed due to the differences between real and synthetic images. In this study we consider the problem of Unsupervised Domain Adaptation for object detection in cultural sites. To address this problem, we created a new dataset containing both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fpv-iplab/DA-RetinaNet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocal Loss · 1x1 Convolution · Convolution · Feature Pyramid Network · RetinaNet