UPB at SemEval-2020 Task 8: Joint Textual and Visual Modeling in a Multi-Task Learning Architecture for Memotion Analysis
George-Alexandru Vlad, George-Eduard Zaharia, Dumitru-Clementin, Cercel, Costin-Gabriel Chiru, Stefan Trausan-Matu

TL;DR
This paper presents a multimodal multi-task learning system combining textual and visual data for Memotion analysis, achieving top rankings in the SemEval-2020 competition.
Contribution
It introduces a novel joint architecture using ALBERT and VGG-16 for effective meme analysis, surpassing baseline performance in all subtasks.
Findings
Achieved 1st place in Subtask B with 0.5183 macro F1-score.
Outperformed baseline results significantly across all subtasks.
Demonstrated the effectiveness of multimodal multi-task learning for meme sentiment analysis.
Abstract
Users from the online environment can create different ways of expressing their thoughts, opinions, or conception of amusement. Internet memes were created specifically for these situations. Their main purpose is to transmit ideas by using combinations of images and texts such that they will create a certain state for the receptor, depending on the message the meme has to send. These posts can be related to various situations or events, thus adding a funny side to any circumstance our world is situated in. In this paper, we describe the system developed by our team for SemEval-2020 Task 8: Memotion Analysis. More specifically, we introduce a novel system to analyze these posts, a multimodal multi-task learning architecture that combines ALBERT for text encoding with VGG-16 for image representation. In this manner, we show that the information behind them can be properly revealed. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Adam · Layer Normalization · Dense Connections · WordPiece · Attention Is All You Need · LAMB · Multi-Head Attention · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection
