Creativity: Generating Diverse Questions using Variational Autoencoders

Unnat Jain; Ziyu Zhang; Alexander Schwing

arXiv:1704.03493·cs.CV·April 13, 2017·34 cites

Creativity: Generating Diverse Questions using Variational Autoencoders

Unnat Jain, Ziyu Zhang, Alexander Schwing

PDF

Open Access

TL;DR

This paper presents a novel method combining variational autoencoders and LSTMs to generate diverse, plausible questions from images, enhancing AI's creative question generation capabilities.

Contribution

It introduces a creative algorithm that leverages VAEs and LSTMs to produce a wide variety of questions from a single image, addressing the need for diversity.

Findings

01

Able to generate a large set of varying questions from one image

02

Demonstrates the effectiveness of combining VAEs with LSTMs for diversity

03

Enhances AI's ability to produce creative and plausible questions

Abstract

Generating diverse questions for given images is an important task for computational education, entertainment and AI assistants. Different from many conventional prediction techniques is the need for algorithms to generate a diverse set of plausible questions, which we refer to as "creativity". In this paper we propose a creative algorithm for visual question generation which combines the advantages of variational autoencoders with long short-term memory networks. We demonstrate that our framework is able to generate a large set of varying questions given a single input image.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Video Analysis and Summarization · Advanced Image and Video Retrieval Techniques