MOSAIC: Multimodal Multistakeholder-aware Visual Art Recommendation

Bereket A. Yilma; Luis A. Leiva

arXiv:2407.21758·cs.IR·August 1, 2024

MOSAIC: Multimodal Multistakeholder-aware Visual Art Recommendation

Bereket A. Yilma, Luis A. Leiva

PDF

Open Access

TL;DR

MOSAIC is a multimodal, multistakeholder-aware visual art recommendation system that balances user preferences with stakeholder interests using advanced AI models, improving diversity and relevance.

Contribution

The paper introduces MOSAIC, a novel approach combining CLIP and BLIP architectures with joint optimization for stakeholder-aware art recommendations.

Findings

01

Popularity significantly influences user preferences.

02

Representativeness has a minimal effect on user perception.

03

MOSAIC benefits multiple art stakeholders beyond visitors.

Abstract

Visual art (VA) recommendation is complex, as it has to consider the interests of users (e.g. museum visitors) and other stakeholders (e.g. museum curators). We study how to effectively account for key stakeholders in VA recommendations while also considering user-centred measures such as novelty, serendipity, and diversity. We propose MOSAIC, a novel multimodal multistakeholder-aware approach using state-of-the-art CLIP and BLIP backbone architectures and two joint optimisation objectives: popularity and representative selection of paintings across different categories. We conducted an offline evaluation using preferences elicited from 213 users followed by a user study with 100 crowdworkers. We found a strong effect of popularity, which was positively perceived by users, and a minimal effect of representativeness. MOSAIC's impact extends beyond visitors, benefiting various art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAesthetic Perception and Analysis · Image Retrieval and Classification Techniques · Digital Media and Visual Art

MethodsBLIP: Bootstrapping Language-Image Pre-training · Contrastive Language-Image Pre-training