Picture What you Read

Ignazio Gallo; Shah Nawaz; Alessandro Calefati; Riccardo La Grassa,; Nicola Landro

arXiv:1909.05663·cs.CV·September 13, 2019

Picture What you Read

Ignazio Gallo, Shah Nawaz, Alessandro Calefati, Riccardo La Grassa,, Nicola Landro

PDF

1 Repo

TL;DR

This paper explores using convolutional neural networks to generate realistic images from text descriptions, enhancing visualization and comprehension of textual content.

Contribution

It introduces a CNN-based model capable of generating images conditioned on natural language descriptions, demonstrating its effectiveness through various experiments.

Findings

01

The model can produce realistic images from text descriptions.

02

Experiments show the generated images accurately reflect the semantic content.

03

The approach advances visualization techniques in NLP and image synthesis.

Abstract

Visualization refers to our ability to create an image in our head based on the text we read or the words we hear. It is one of the many skills that makes reading comprehension possible. Convolutional Neural Networks (CNN) are an excellent tool for recognizing and classifying text documents. In addition, it can generate images conditioned on natural language. In this work, we utilize CNNs capabilities to generate realistic images representative of the text illustrating the semantic concept. We conducted various experiments to highlight the capacity of the proposed model to generate representative images of the text descriptions used as input to the proposed model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://www.kaggle.com/ignazio/train-and-test-a-cnn
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.