System Demo for Transfer Learning across Vision and Text using Domain Specific CNN Accelerator for On-Device NLP Applications
Baohua Sun, Lin Yang, Michael Lin, Wenhan Zhang, Patrick Dong, Charles, Young, Jason Dong

TL;DR
This paper demonstrates the implementation of transfer learning for text classification and sentiment analysis on mobile devices using power-efficient CNN-DSA chips, achieving state-of-the-art results with low power consumption.
Contribution
It introduces a novel application of CNN-DSA chips for on-device NLP tasks using transfer learning from vision models, with optimized network compression techniques.
Findings
Achieved power consumption below 300mW on mobile devices.
Successfully classified English sentences into 14 ontologies.
Performed sentiment analysis on Chinese reviews.
Abstract
Power-efficient CNN Domain Specific Accelerator (CNN-DSA) chips are currently available for wide use in mobile devices. These chips are mainly used in computer vision applications. However, the recent work of Super Characters method for text classification and sentiment analysis tasks using two-dimensional CNN models has also achieved state-of-the-art results through the method of transfer learning from vision to text. In this paper, we implemented the text classification and sentiment analysis applications on mobile devices using CNN-DSA chips. Compact network representations using one-bit and three-bits precision for coefficients and five-bits for activations are used in the CNN-DSA chip with power consumption less than 300mW. For edge devices under memory and compute constraints, the network is further compressed by approximating the external Fully Connected (FC) layers within the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
