Machine-in-the-Loop Rewriting for Creative Image Captioning

Vishakh Padmakumar; He He

arXiv:2111.04193·cs.CL·May 10, 2022

Machine-in-the-Loop Rewriting for Creative Image Captioning

Vishakh Padmakumar, He He

PDF

Open Access 1 Repo

TL;DR

This paper introduces a rewriting model that collaborates with humans to enhance creative image captions by locally modifying drafts, resulting in more descriptive and figurative captions, and outperforms baseline models in user studies.

Contribution

The paper presents a novel rewriting approach that allows humans to retain control while improving creative image captions through targeted text modifications.

Findings

01

The model is rated more helpful than baseline infilling models.

02

Users produce more descriptive captions with the model.

03

Collaborative captions are more figurative and expressive.

Abstract

Machine-in-the-loop writing aims to enable humans to collaborate with models to complete their writing tasks more effectively. Prior work has found that providing humans a machine-written draft or sentence-level continuations has limited success since the generated text tends to deviate from humans' intention. To allow the user to retain control over the content, we train a rewriting model that, when prompted, modifies specified spans of text within the user's original draft to introduce descriptive and figurative elements locally in the text. We evaluate the model on its ability to collaborate with humans on the task of creative image captioning. On a user study through Amazon Mechanical Turk, our model is rated to be more helpful than a baseline infilling language model. In addition, third-party evaluation shows that users write more descriptive and figurative captions when…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vishakhpk/mil-creative-captioning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Educational Games and Gamification · Topic Modeling