Edina: Building an Open Domain Socialbot with Self-dialogues

Ben Krause; Marco Damonte; Mihai Dobre; Daniel Duma; Joachim Fainberg,; Federico Fancellu; Emmanuel Kahembwe; Jianpeng Cheng; Bonnie Webber

arXiv:1709.09816·cs.CL·September 29, 2017·25 cites

Edina: Building an Open Domain Socialbot with Self-dialogues

Ben Krause, Marco Damonte, Mihai Dobre, Daniel Duma, Joachim Fainberg,, Federico Fancellu, Emmanuel Kahembwe, Jianpeng Cheng, Bonnie Webber

PDF

Open Access

TL;DR

Edina is a social chatbot developed for the Amazon Alexa Prize that uses innovative self-dialogues from Mechanical Turk data to train neural networks and improve response quality through a hybrid rule-based and machine learning architecture.

Contribution

The paper introduces a novel self-dialogue data collection technique and a hybrid architecture combining rule-based and neural network components for socialbots.

Findings

01

Self-dialogues are natural and efficient for data collection.

02

Confidence scores from soft rules correlate with reply quality.

03

Hybrid approach improves response coverage and quality.

Abstract

We present Edina, the University of Edinburgh's social bot for the Amazon Alexa Prize competition. Edina is a conversational agent whose responses utilize data harvested from Amazon Mechanical Turk (AMT) through an innovative new technique we call self-dialogues. These are conversations in which a single AMT Worker plays both participants in a dialogue. Such dialogues are surprisingly natural, efficient to collect and reflective of relevant and/or trending topics. These self-dialogues provide training data for a generative neural network as well as a basis for soft rules used by a matching score component. Each match of a soft rule against a user utterance is associated with a confidence score which we show is strongly indicative of reply quality, allowing this component to self-censor and be effectively integrated with other components. Edina's full architecture features a rule-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Speech and dialogue systems