LIDSNet: A Lightweight on-device Intent Detection model using Deep Siamese Network
Vibhav Agarwal, Sudeep Deepak Shivnikar, Sourav Ghosh, Himanshu Arora,, Yashwant Saini

TL;DR
LIDSNet is a lightweight, accurate intent detection model designed for on-device deployment, utilizing a Deep Siamese Network with character-level features and transfer learning to achieve high accuracy with minimal parameters.
Contribution
The paper introduces LIDSNet, a novel lightweight intent detection model that combines a Deep Siamese Network with character-level features and transfer learning for efficient on-device NLP applications.
Findings
Achieves 98.00% accuracy on SNIPS dataset
Contains under 0.59 million parameters
Is 41 times lighter and 30 times faster than MobileBERT on a Galaxy S20
Abstract
Intent detection is a crucial task in any Natural Language Understanding (NLU) system and forms the foundation of a task-oriented dialogue system. To build high-quality real-world conversational solutions for edge devices, there is a need for deploying intent detection model on device. This necessitates a light-weight, fast, and accurate model that can perform efficiently in a resource-constrained environment. To this end, we propose LIDSNet, a novel lightweight on-device intent detection model, which accurately predicts the message intent by utilizing a Deep Siamese Network for learning better sentence representations. We use character-level features to enrich the sentence-level representations and empirically demonstrate the advantage of transfer learning by utilizing pre-trained embeddings. Furthermore, to investigate the efficacy of the modules in our architecture, we conduct an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Sigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM · Multi-Head Attention · Softmax · Residual Connection · Dense Connections
