The Cross-Depiction Problem: Computer Vision Algorithms for Recognising   Objects in Artwork and in Photographs

Hongping Cai; Qi Wu; Tadeo Corradi; Peter Hall

arXiv:1505.00110·cs.CV·May 4, 2015·31 cites

The Cross-Depiction Problem: Computer Vision Algorithms for Recognising Objects in Artwork and in Photographs

Hongping Cai, Qi Wu, Tadeo Corradi, Peter Hall

PDF

Open Access 1 Repo

TL;DR

This paper investigates the challenge of recognizing objects across different depiction styles like photographs and artwork, benchmarking various methods and highlighting the importance of spatial relations in improving robustness.

Contribution

It provides a comprehensive benchmark of classification, domain adaptation, and deep learning methods for the cross-depiction problem, revealing their limitations and emphasizing the role of spatial relations.

Findings

01

Deep learning methods perform poorly on diverse depiction styles.

02

Models with strong spatial relation understanding are more robust.

03

Most methods struggle with inhomogeneous databases.

Abstract

The cross-depiction problem is that of recognising visual objects regardless of whether they are photographed, painted, drawn, etc. It is a potentially significant yet under-researched problem. Emulating the remarkable human ability to recognise objects in an astonishingly wide variety of depictive forms is likely to advance both the foundations and the applications of Computer Vision. In this paper we benchmark classification, domain adaptation, and deep learning methods; demonstrating that none perform consistently well in the cross-depiction problem. Given the current interest in deep learning, the fact such methods exhibit the same behaviour as all but one other method: they show a significant fall in performance over inhomogeneous databases compared to their peak performance, which is always over data comprising photographs only. Rather, we find the methods that have strong…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BathVisArtData/PeopleArt
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning