Teaching Machines to Describe Images via Natural Language Feedback

Huan Ling; Sanja Fidler

arXiv:1706.00130·cs.CL·June 6, 2017·32 cites

Teaching Machines to Describe Images via Natural Language Feedback

Huan Ling, Sanja Fidler

PDF

Open Access

TL;DR

This paper introduces a method for training image captioning models using natural language feedback from humans, which improves learning efficiency and caption quality over traditional methods.

Contribution

It presents a hierarchical phrase-based captioning model trained with policy gradients that incorporates human natural language feedback as a learning signal.

Findings

01

Model outperforms captioning with only human-written captions

02

Natural language feedback provides a stronger learning signal

03

Hierarchical model effectively integrates descriptive feedback

Abstract

Robots will eventually be part of every household. It is thus critical to enable algorithms to learn from and be guided by non-expert users. In this paper, we bring a human in the loop, and enable a human teacher to give feedback to a learning agent in the form of natural language. We argue that a descriptive sentence can provide a much stronger learning signal than a numeric reward in that it can easily point to where the mistakes are and how to correct them. We focus on the problem of image captioning in which the quality of the output can easily be judged by non-experts. We propose a hierarchical phrase-based captioning model trained with policy gradients, and design a feedback network that provides reward to the learner by conditioning on the human-provided feedback. We show that by exploiting descriptive feedback our model learns to perform better than when given independently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques