Art Authentication with Vision Transformers
Ludovica Schaerf, Carina Popovici, Eric Postma

TL;DR
This paper evaluates the effectiveness of Vision Transformers, specifically Swin Transformers, in art authentication tasks, comparing their performance to EfficientNet on datasets of Van Gogh paintings and imitations.
Contribution
It demonstrates that Vision Transformers can outperform traditional CNNs like EfficientNet in detecting art forgeries, especially with imitation datasets.
Findings
Swin Transformers achieved over 85% accuracy on imitation datasets.
EfficientNet performed better overall on the contrast set with diverse artworks.
Vision Transformers show promise in improving computer-based art authentication.
Abstract
In recent years, Transformers, initially developed for language, have been successfully applied to visual tasks. Vision Transformers have been shown to push the state-of-the-art in a wide range of tasks, including image classification, object detection, and semantic segmentation. While ample research has shown promising results in art attribution and art authentication tasks using Convolutional Neural Networks, this paper examines if the superiority of Vision Transformers extends to art authentication, improving, thus, the reliability of computer-based authentication of artworks. Using a carefully compiled dataset of authentic paintings by Vincent van Gogh and two contrast datasets, we compare the art authentication performances of Swin Transformers with those of EfficientNet. Using a standard contrast set containing imitations and proxies (works by painters with styles closely related…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAesthetic Perception and Analysis · Visual Attention and Saliency Detection · Generative Adversarial Networks and Image Synthesis
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Attention Is All You Need · Pointwise Convolution · Layer Normalization · Absolute Position Encodings · Depthwise Convolution · Batch Normalization · Sigmoid Activation · Label Smoothing · Depthwise Separable Convolution
