Question Embeddings Based on Shannon Entropy: Solving intent   classification task in goal-oriented dialogue system

Aleksandr Perevalov; Daniil Kurushin; Rustam Faizrakhmanov; Farida; Khabibrakhmanova

arXiv:1904.00785·cs.CL·April 2, 2019

Question Embeddings Based on Shannon Entropy: Solving intent classification task in goal-oriented dialogue system

Aleksandr Perevalov, Daniil Kurushin, Rustam Faizrakhmanov, Farida, Khabibrakhmanova

PDF

1 Repo

TL;DR

This paper introduces a novel question embedding method based on Shannon entropy to improve intent classification in goal-oriented dialogue systems, especially effective with small datasets.

Contribution

The paper proposes a Shannon entropy-based question embedding approach that outperforms traditional methods in low-data scenarios for intent classification.

Findings

01

Proposed entropy-based embeddings outperform traditional models.

02

Method performs well with small datasets.

03

Experimental results show improved accuracy in intent detection.

Abstract

Question-answering systems and voice assistants are becoming major part of client service departments of many organizations, helping them to reduce the labor costs of staff. In many such systems, there is always natural language understanding module that solves intent classification task. This task is complicated because of its case-dependency - every subject area has its own semantic kernel. The state of art approaches for intent classification are different machine learning and deep learning methods that use text vector representations as input. The basic vector representation models such as Bag of words and TF-IDF generate sparse matrixes, which are becoming very big as the amount of input data grows. Modern methods such as word2vec and FastText use neural networks to evaluate word embeddings with fixed dimension size. As we are developing a question-answering system for students and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Perevalov/intent_classifier
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLogistic Regression · fastText