# Good News, Everyone! Context driven entity-aware captioning for news   images

**Authors:** Ali Furkan Biten, Lluis Gomez, Mar\c{c}al Rusi\~nol, Dimosthenis, Karatzas

arXiv: 1904.01475 · 2019-04-03

## TL;DR

This paper introduces a novel news image captioning method that leverages contextual information from news articles to produce more interpretative captions, along with a new large dataset and state-of-the-art results.

## Contribution

The work presents a new context-aware captioning model for news images and introduces the 'GoodNews' dataset, advancing beyond descriptive captioning to interpretative scene understanding.

## Key findings

- State-of-the-art performance on news image captioning
- Effective integration of article context into caption generation
- Ability to extend vocabulary with out-of-vocabulary entities

## Abstract

Current image captioning systems perform at a merely descriptive level, essentially enumerating the objects in the scene and their relations. Humans, on the contrary, interpret images by integrating several sources of prior knowledge of the world. In this work, we aim to take a step closer to producing captions that offer a plausible interpretation of the scene, by integrating such contextual information into the captioning pipeline. For this we focus on the captioning of images used to illustrate news articles. We propose a novel captioning method that is able to leverage contextual information provided by the text of news articles associated with an image. Our model is able to selectively draw information from the article guided by visual cues, and to dynamically extend the output dictionary to out-of-vocabulary named entities that appear in the context source. Furthermore we introduce `GoodNews', the largest news image captioning dataset in the literature and demonstrate state-of-the-art results.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.01475/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/1904.01475/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/1904.01475/full.md

---
Source: https://tomesphere.com/paper/1904.01475