Graph Neural Networks for Knowledge Enhanced Visual Representation of Paintings

Athanasios Efthymiou; Stevan Rudinac; Monika Kackovic; Marcel Worring; Nachoem Wijnberg

arXiv:2105.08190·cs.CV·May 27, 2025

Graph Neural Networks for Knowledge Enhanced Visual Representation of Paintings

Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Marcel Worring, Nachoem Wijnberg

PDF

TL;DR

ArtSAGENet combines GNNs and CNNs for multimodal fine art analysis, outperforming traditional methods by capturing relational dependencies and requiring less data and computation.

Contribution

The paper introduces ArtSAGENet, a novel architecture that effectively integrates GNNs and CNNs for improved fine art analysis using multi-task learning.

Findings

01

GNN architectures outperform CNN baselines in art tasks

02

Multi-task learning benefits fine art analysis

03

Relational dependencies improve art representation

Abstract

We propose ArtSAGENet, a novel multimodal architecture that integrates Graph Neural Networks (GNNs) and Convolutional Neural Networks (CNNs), to jointly learn visual and semantic-based artistic representations. First, we illustrate the significant advantages of multi-task learning for fine art analysis and argue that it is conceptually a much more appropriate setting in the fine art domain than the single-task alternatives. We further demonstrate that several GNN architectures can outperform strong CNN baselines in a range of fine art analysis tasks, such as style classification, artist attribution, creation period estimation, and tag prediction, while training them requires an order of magnitude less computational time and only a small amount of labeled data. Finally, through extensive experimentation we show that our proposed ArtSAGENet captures and encodes valuable relational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.