TL;DR
This paper introduces a novel question embedding method based on Shannon entropy to improve intent classification in goal-oriented dialogue systems, especially effective with small datasets.
Contribution
The paper proposes a Shannon entropy-based question embedding approach that outperforms traditional methods in low-data scenarios for intent classification.
Findings
Proposed entropy-based embeddings outperform traditional models.
Method performs well with small datasets.
Experimental results show improved accuracy in intent detection.
Abstract
Question-answering systems and voice assistants are becoming major part of client service departments of many organizations, helping them to reduce the labor costs of staff. In many such systems, there is always natural language understanding module that solves intent classification task. This task is complicated because of its case-dependency - every subject area has its own semantic kernel. The state of art approaches for intent classification are different machine learning and deep learning methods that use text vector representations as input. The basic vector representation models such as Bag of words and TF-IDF generate sparse matrixes, which are becoming very big as the amount of input data grows. Modern methods such as word2vec and FastText use neural networks to evaluate word embeddings with fixed dimension size. As we are developing a question-answering system for students and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLogistic Regression · fastText
