Auto-tagging of Short Conversational Sentences using Natural Language   Processing Methods

\c{S}\"ukr\"u Ozan; D. Emre Ta\c{s}ar

arXiv:2106.04959·cs.CL·June 10, 2021

Auto-tagging of Short Conversational Sentences using Natural Language Processing Methods

\c{S}\"ukr\"u Ozan, D. Emre Ta\c{s}ar

PDF

1 Repo

TL;DR

This paper develops and compares transformer-based models for auto-tagging short conversational sentences in a specific domain, aiming to enhance chatbot dialogue generation.

Contribution

It introduces a dataset of manually tagged conversational sentences and evaluates multiple models, achieving the best results with BERT for domain-specific auto-tagging.

Findings

01

BERT outperforms other models in auto-tagging accuracy

02

Manually tagged dataset of 14,000 sentences created for this task

03

Models are publicly available for replication and further research

Abstract

In this study, we aim to find a method to auto-tag sentences specific to a domain. Our training data comprises short conversational sentences extracted from chat conversations between company's customer representatives and web site visitors. We manually tagged approximately 14 thousand visitor inputs into ten basic categories, which will later be used in a transformer-based language model with attention mechanisms for the ultimate goal of developing a chatbot application that can produce meaningful dialogue. We considered three different state-of-the-art models and reported their auto-tagging capabilities. We achieved the best performance with the bidirectional encoder representation from transformers (BERT) model. Implementation of the models used in these experiments can be cloned from our GitHub repository and tested for similar auto-tagging problems without much effort.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adresgezgini/NLP4AT
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.