mAnI: Movie Amalgamation using Neural Imitation

Naveen Panwar; Shreya Khare; Neelamadhav Gantayat; Rahul Aralikatte,; Senthil Mani; Anush Sankaran

arXiv:1708.04923·cs.CL·August 17, 2017

mAnI: Movie Amalgamation using Neural Imitation

Naveen Panwar, Shreya Khare, Neelamadhav Gantayat, Rahul Aralikatte,, Senthil Mani, Anush Sankaran

PDF

Open Access

TL;DR

This paper introduces mAnI, a neural approach to visualize book content by stitching relevant movie frames, exploring dialog, visual, and hybrid models to match book sentences with movie visuals.

Contribution

It pioneers the use of deep learning models to visualize book content through movie frames, comparing dialog, visual, and hybrid approaches.

Findings

01

Hybrid model outperforms dialog-only and visual-only models.

02

Proven effectiveness on the MovieBook dataset.

03

First step towards automated book-to-movie visualization.

Abstract

Cross-modal data retrieval has been the basis of various creative tasks performed by Artificial Intelligence (AI). One such highly challenging task for AI is to convert a book into its corresponding movie, which most of the creative film makers do as of today. In this research, we take the first step towards it by visualizing the content of a book using its corresponding movie visuals. Given a set of sentences from a book or even a fan-fiction written in the same universe, we employ deep learning models to visualize the input by stitching together relevant frames from the movie. We studied and compared three different types of setting to match the book with the movie content: (i) Dialog model: using only the dialog from the movie, (ii) Visual model: using only the visual content from the movie, and (iii) Hybrid model: using the dialog and the visual content from the movie. Experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Video Analysis and Summarization · Human Pose and Action Recognition