Multi-modal application: Image Memes Generation

Zhiyuan Liu; Chuanzheng Sun; Yuxin Jiang; Shiqi Jiang; Mei Ming

arXiv:2112.01651·cs.CV·December 6, 2021

Multi-modal application: Image Memes Generation

Zhiyuan Liu, Chuanzheng Sun, Yuxin Jiang, Shiqi Jiang, Mei Ming

PDF

Open Access

TL;DR

This paper presents an end-to-end AI system that generates image memes by selecting appropriate templates based on input sentences and creating matching captions, advancing meme creation automation.

Contribution

It introduces a novel encoder-decoder architecture that combines emotion-based template selection with caption generation for meme creation.

Findings

01

The system effectively generates relevant memes from input sentences.

02

The model achieves high accuracy in template selection based on emotional content.

03

Code and models are publicly available for further research.

Abstract

Meme is an interesting word. Internet memes offer unique insights into the changes in our perception of the world, the media and our own lives. If you surf the Internet for long enough, you will see it somewhere on the Internet. With the rise of social media platforms and convenient image dissemination, Image Meme has gained fame. Image memes have become a kind of pop culture and they play an important role in communication over social media, blogs, and open messages. With the development of artificial intelligence and the widespread use of deep learning, Natural Language Processing (NLP) and Computer Vision (CV) can also be used to solve more problems in life, including meme generation. An Internet meme commonly takes the form of an image and is created by combining a meme template (image) and a caption (natural language sentence). In our project, we propose an end-to-end…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining