TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li, Yale Song, Liangliang Cao, Joel Tetreault, Larry, Goldberg, Alejandro Jaimes, Jiebo Luo

TL;DR
This paper introduces TGIF, a large dataset of animated GIFs with natural language descriptions, serving as a benchmark for GIF understanding and captioning, and demonstrates baseline models and their potential for related tasks.
Contribution
The paper presents TGIF, a new high-quality dataset for animated GIF captioning, along with validation methods, statistical analysis, baseline models, and insights into transfer learning for movie description.
Findings
High correlation between visual content and descriptions in TGIF.
Baseline models achieve promising results on GIF captioning.
Fine-tuned models improve automatic movie description.
Abstract
With the recent popularity of animated GIFs on social media, there is need for ways to index them with rich metadata. To advance research on animated GIF understanding, we collected a new dataset, Tumblr GIF (TGIF), with 100K animated GIFs from Tumblr and 120K natural language descriptions obtained via crowdsourcing. The motivation for this work is to develop a testbed for image sequence description systems, where the task is to generate natural language descriptions for animated GIFs or video clips. To ensure a high quality dataset, we developed a series of novel quality controls to validate free-form text input from crowdworkers. We show that there is unambiguous association between visual content and natural language descriptions in our dataset, making it an ideal benchmark for the visual content captioning task. We perform extensive statistical analyses to compare our dataset to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
TGIF: A New Dataset and Benchmark on Animated GIF Description· youtube
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning
