Training an adaptive dialogue policy for interactive learning of   visually grounded word meanings

Yanchao Yu; Arash Eshghi; Oliver Lemon

arXiv:1709.10426·cs.CL·October 2, 2017

Training an adaptive dialogue policy for interactive learning of visually grounded word meanings

Yanchao Yu, Arash Eshghi, Oliver Lemon

PDF

TL;DR

This paper introduces a multi-modal dialogue system that interactively learns visually grounded word meanings from a human tutor, optimizing dialogue strategies to improve learning efficiency and accuracy.

Contribution

It develops an adaptive dialogue policy that balances classifier accuracy and tutoring costs, integrating semantic parsing with visual classifiers in an interactive setting.

Findings

01

Dialogue initiative affects learning performance

02

Expressing confidence levels improves learning efficiency

03

Handling elliptical and incremental turns enhances system robustness

Abstract

We present a multi-modal dialogue system for interactive learning of perceptually grounded word meanings from a human tutor. The system integrates an incremental, semantic parsing/generation framework - Dynamic Syntax and Type Theory with Records (DS-TTR) - with a set of visual classifiers that are learned throughout the interaction and which ground the meaning representations that it produces. We use this system in interaction with a simulated human tutor to study the effects of different dialogue policies and capabilities on the accuracy of learned meanings, learning rates, and efforts/costs to the tutor. We show that the overall performance of the learning agent is affected by (1) who takes initiative in the dialogues; (2) the ability to express/use their confidence level about visual attributes; and (3) the ability to process elliptical and incrementally constructed dialogue turns.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.