Exploiting Word Semantics to Enrich Character Representations of Chinese Pre-trained Models
Wenbiao Li, Rui Sun, Yunfang Wu

TL;DR
This paper introduces a method to incorporate lexical semantics and word structure into Chinese character-based pre-trained models, improving their performance on various NLP tasks by leveraging word-level information.
Contribution
It proposes a novel approach to integrate word semantics into character representations, including a word-to-character alignment and ensemble segmentation to enhance Chinese NLP models.
Findings
Achieves superior performance on multiple Chinese NLP tasks.
Effectively incorporates word semantics into character-based models.
Improves robustness against segmentation errors.
Abstract
Most of the Chinese pre-trained models adopt characters as basic units for downstream tasks. However, these models ignore the information carried by words and thus lead to the loss of some important semantics. In this paper, we propose a new method to exploit word structure and integrate lexical semantics into character representations of pre-trained models. Specifically, we project a word's embedding into its internal characters' embeddings according to the similarity weight. To strengthen the word boundary information, we mix the representations of the internal characters within a word. After that, we apply a word-to-character alignment attention mechanism to emphasize important characters by masking unimportant ones. Moreover, in order to reduce the error propagation caused by word segmentation, we present an ensemble approach to combine segmentation results given by different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · ERNIE · Linear Layer · Attention Dropout · Adam · Residual Connection · Layer Normalization · Linear Warmup With Linear Decay
