Post-Training Quantization of Generative and Discriminative LSTM Text Classifiers: A Study of Calibration, Class Balance, and Robustness
Md Mushfiqur Rahaman, Elliot Chang, Tasmiah Haque, Srinjoy Das

TL;DR
This paper compares the effects of post-training quantization on generative and discriminative LSTM text classifiers, focusing on calibration, class imbalance, and robustness in edge computing scenarios.
Contribution
It provides a comprehensive analysis of how PTQ impacts generative versus discriminative LSTM classifiers, highlighting the importance of calibration data and class balance.
Findings
Discriminative classifiers are more robust to quantization noise.
Generative classifiers are sensitive to calibration data and class imbalance.
Class imbalance during calibration degrades generative classifier performance at low bitwidths.
Abstract
Text classification plays a pivotal role in edge computing applications like industrial monitoring, health diagnostics, and smart assistants, where low latency and high accuracy are both key requirements. Generative classifiers, in particular, have been shown to exhibit robustness to out-of-distribution and noisy data, which is an extremely critical consideration for deployment in such real-time edge environments. However, deploying such models on edge devices faces computational and memory constraints. Post Training Quantization (PTQ) reduces model size and compute costs without retraining, making it ideal for edge deployment. In this work, we present a comprehensive comparative study of generative and discriminative Long Short Term Memory (LSTM)-based text classification models with PTQ using the Brevitas quantization library. We evaluate both types of classifier models across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies
