Green CWS: Extreme Distillation and Efficient Decode Method Towards Industrial Application
Yulan Hu, Yong Liu

TL;DR
This paper introduces a lightweight Transformer-based Chinese Word Segmentation framework with an enhanced decoding method, achieving high accuracy and efficiency suitable for industrial low-resource scenarios.
Contribution
It proposes a novel distillation and decoding approach combining a lightweight Transformer model with an improved CRF method for efficient CWS.
Findings
Achieves 14% of the inference time of BERT-based models.
Outperforms traditional decoding methods in low-resource settings.
Maintains high segmentation accuracy across multiple datasets.
Abstract
Benefiting from the strong ability of the pre-trained model, the research on Chinese Word Segmentation (CWS) has made great progress in recent years. However, due to massive computation, large and complex models are incapable of empowering their ability for industrial use. On the other hand, for low-resource scenarios, the prevalent decode method, such as Conditional Random Field (CRF), fails to exploit the full information of the training data. This work proposes a fast and accurate CWS framework that incorporates a light-weighted model and an upgraded decode method (PCRF) towards industrially low-resource CWS scenarios. First, we distill a Transformer-based student model as an encoder, which not only accelerates the inference speed but also combines open knowledge and domain-specific knowledge. Second, the perplexity score to evaluate the language model is fused into the CRF module to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Conditional Random Field
