A Bi-Encoder LSTM Model For Learning Unstructured Dialogs

Danny Brahman; Pooran S. Negi; Mohammad Mahoor

arXiv:2104.12269·cs.CL·January 28, 2026·1 cites

A Bi-Encoder LSTM Model For Learning Unstructured Dialogs

Danny Brahman, Pooran S. Negi, Mohammad Mahoor

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Bi-Encoder LSTM model for learning unstructured multi-turn dialogs, improving response selection accuracy in retrieval-based chatbots using large dialog datasets.

Contribution

The paper proposes a novel LSTM-based architecture for dialog response selection and demonstrates its effectiveness on the Ubuntu Dialog Corpus with improved accuracy over benchmarks.

Findings

01

Achieved higher Recall@1, @2, @5 accuracy than benchmark models.

02

Evaluated multiple similarity functions and hyper-parameters.

03

Validated the model's effectiveness on large-scale dialog data.

Abstract

Creating a data-driven model that is trained on a large dataset of unstructured dialogs is a crucial step in developing Retrieval-based Chatbot systems. This paper presents a Long Short Term Memory (LSTM) based architecture that learns unstructured multi-turn dialogs and provides results on the task of selecting the best response from a collection of given responses. Ubuntu Dialog Corpus Version 2 was used as the corpus for training. We show that our model achieves 0.8%, 1.0% and 0.3% higher accuracy for Recall@1, Recall@2 and Recall@5 respectively than the benchmark model. We also show results on experiments performed by using several similarity functions, model hyper-parameters and word embeddings on the proposed architecture

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DiwanshuShekhar/bi_encoder_lstm
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Sentiment Analysis and Opinion Mining