RoMemes: A multimodal meme corpus for the Romanian language
Vasile P\u{a}i\c{s}, Sara Ni\c{t}\u{a}, Alexandru-Iulius Jerpelea,, Luca Pan\u{a}, Eric Curea

TL;DR
This paper introduces RoMemes, a curated multimodal meme dataset in Romanian, along with baseline algorithms, highlighting the need for improved AI tools to understand internet memes.
Contribution
The paper presents the first Romanian multimodal meme dataset with multiple annotations and baseline algorithms to demonstrate its usability.
Findings
Baseline algorithms show potential but need improvement for meme understanding.
The dataset enables future research in multimodal AI for Romanian memes.
Results highlight challenges in processing internet memes with current AI tools.
Abstract
Memes are becoming increasingly more popular in online media, especially in social networks. They usually combine graphical representations (images, drawings, animations or video) with text to convey powerful messages. In order to extract, process and understand the messages, AI applications need to employ multimodal algorithms. In this paper, we introduce a curated dataset of real memes in the Romanian language, with multiple annotation levels. Baseline algorithms were employed to demonstrate the usability of the dataset. Results indicate that further research is needed to improve the processing capabilities of AI tools when faced with Internet memes.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Communication and Language
