Green CWS: Extreme Distillation and Efficient Decode Method Towards   Industrial Application

Yulan Hu; Yong Liu

arXiv:2111.09078·cs.AI·November 18, 2021

Green CWS: Extreme Distillation and Efficient Decode Method Towards Industrial Application

Yulan Hu, Yong Liu

PDF

Open Access

TL;DR

This paper introduces a lightweight Transformer-based Chinese Word Segmentation framework with an enhanced decoding method, achieving high accuracy and efficiency suitable for industrial low-resource scenarios.

Contribution

It proposes a novel distillation and decoding approach combining a lightweight Transformer model with an improved CRF method for efficient CWS.

Findings

01

Achieves 14% of the inference time of BERT-based models.

02

Outperforms traditional decoding methods in low-resource settings.

03

Maintains high segmentation accuracy across multiple datasets.

Abstract

Benefiting from the strong ability of the pre-trained model, the research on Chinese Word Segmentation (CWS) has made great progress in recent years. However, due to massive computation, large and complex models are incapable of empowering their ability for industrial use. On the other hand, for low-resource scenarios, the prevalent decode method, such as Conditional Random Field (CRF), fails to exploit the full information of the training data. This work proposes a fast and accurate CWS framework that incorporates a light-weighted model and an upgraded decode method (PCRF) towards industrially low-resource CWS scenarios. First, we distill a Transformer-based student model as an encoder, which not only accelerates the inference speed but also combines open knowledge and domain-specific knowledge. Second, the perplexity score to evaluate the language model is fused into the CRF module to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Conditional Random Field