Improved Deep Learning Baselines for Ubuntu Corpus Dialogs

Rudolf Kadlec; Martin Schmid; Jan Kleindienst

arXiv:1510.03753·cs.CL·November 4, 2015·92 cites

Improved Deep Learning Baselines for Ubuntu Corpus Dialogs

Rudolf Kadlec, Martin Schmid, Jan Kleindienst

PDF

Open Access

TL;DR

This paper evaluates and improves deep learning models for next utterance ranking in the Ubuntu Dialog Corpus, achieving state-of-the-art results through model evaluation and ensemble techniques.

Contribution

It provides an independent evaluation of existing models, compares various neural architectures, and introduces an ensemble approach to enhance performance.

Findings

01

Ensemble models outperform individual models.

02

Achieved state-of-the-art ranking performance.

03

Evaluated multiple neural network architectures.

Abstract

This paper presents results of our experiments for the next utterance ranking on the Ubuntu Dialog Corpus -- the largest publicly available multi-turn dialog corpus. First, we use an in-house implementation of previously reported models to do an independent evaluation using the same data. Second, we evaluate the performances of various LSTMs, Bi-LSTMs and CNNs on the dataset. Third, we create an ensemble by averaging predictions of multiple models. The ensemble further improves the performance and it achieves a state-of-the-art result for the next utterance ranking on this dataset. Finally, we discuss our future plans using this corpus.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · ICT in Developing Communities