LIDSNet: A Lightweight on-device Intent Detection model using Deep   Siamese Network

Vibhav Agarwal; Sudeep Deepak Shivnikar; Sourav Ghosh; Himanshu Arora,; Yashwant Saini

arXiv:2110.15717·cs.CL·January 31, 2022

LIDSNet: A Lightweight on-device Intent Detection model using Deep Siamese Network

Vibhav Agarwal, Sudeep Deepak Shivnikar, Sourav Ghosh, Himanshu Arora,, Yashwant Saini

PDF

TL;DR

LIDSNet is a lightweight, accurate intent detection model designed for on-device deployment, utilizing a Deep Siamese Network with character-level features and transfer learning to achieve high accuracy with minimal parameters.

Contribution

The paper introduces LIDSNet, a novel lightweight intent detection model that combines a Deep Siamese Network with character-level features and transfer learning for efficient on-device NLP applications.

Findings

01

Achieves 98.00% accuracy on SNIPS dataset

02

Contains under 0.59 million parameters

03

Is 41 times lighter and 30 times faster than MobileBERT on a Galaxy S20

Abstract

Intent detection is a crucial task in any Natural Language Understanding (NLU) system and forms the foundation of a task-oriented dialogue system. To build high-quality real-world conversational solutions for edge devices, there is a need for deploying intent detection model on device. This necessitates a light-weight, fast, and accurate model that can perform efficiently in a resource-constrained environment. To this end, we propose LIDSNet, a novel lightweight on-device intent detection model, which accurately predicts the message intent by utilizing a Deep Siamese Network for learning better sentence representations. We use character-level features to enrich the sentence-level representations and empirically demonstrate the advantage of transfer learning by utilizing pre-trained embeddings. Furthermore, to investigate the efficacy of the modules in our architecture, we conduct an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Linear Layer · Sigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM · Multi-Head Attention · Softmax · Residual Connection · Dense Connections