Exploiting Word Semantics to Enrich Character Representations of Chinese   Pre-trained Models

Wenbiao Li; Rui Sun; Yunfang Wu

arXiv:2207.05928·cs.CL·July 14, 2022

Exploiting Word Semantics to Enrich Character Representations of Chinese Pre-trained Models

Wenbiao Li, Rui Sun, Yunfang Wu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to incorporate lexical semantics and word structure into Chinese character-based pre-trained models, improving their performance on various NLP tasks by leveraging word-level information.

Contribution

It proposes a novel approach to integrate word semantics into character representations, including a word-to-character alignment and ensemble segmentation to enhance Chinese NLP models.

Findings

01

Achieves superior performance on multiple Chinese NLP tasks.

02

Effectively incorporates word semantics into character-based models.

03

Improves robustness against segmentation errors.

Abstract

Most of the Chinese pre-trained models adopt characters as basic units for downstream tasks. However, these models ignore the information carried by words and thus lead to the loss of some important semantics. In this paper, we propose a new method to exploit word structure and integrate lexical semantics into character representations of pre-trained models. Specifically, we project a word's embedding into its internal characters' embeddings according to the similarity weight. To strengthen the word boundary information, we mix the representations of the internal characters within a word. After that, we apply a word-to-character alignment attention mechanism to emphasize important characters by masking unimportant ones. Moreover, in order to reduce the error propagation caused by word segmentation, we present an ensemble approach to combine segmentation results given by different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liwb1219/hrmf
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · ERNIE · Linear Layer · Attention Dropout · Adam · Residual Connection · Layer Normalization · Linear Warmup With Linear Decay