Applications of BERT Based Sequence Tagging Models on Chinese Medical   Text Attributes Extraction

Gang Zhao; Teng Zhang; Chenxiao Wang; Ping Lv; Ji Wu

arXiv:2008.09740·cs.CL·August 25, 2020·1 cites

Applications of BERT Based Sequence Tagging Models on Chinese Medical Text Attributes Extraction

Gang Zhao, Teng Zhang, Chenxiao Wang, Ping Lv, Ji Wu

PDF

Open Access

TL;DR

This paper explores various BERT-based sequence tagging models, including LSTM-CRF, CNN, UCNN, WaveNet, and SelfAttention, for extracting attributes from Chinese medical texts, achieving competitive results through model ensembling.

Contribution

It introduces the application of multiple BERT-based sequence models to Chinese medical text attribute extraction and demonstrates the effectiveness of ensembling these models.

Findings

01

Models reach similar performance levels.

02

Ensembling improves overall system accuracy.

03

Achieved strong results on CCKS 2019 task 1.

Abstract

We convert the Chinese medical text attributes extraction task into a sequence tagging or machine reading comprehension task. Based on BERT pre-trained models, we have not only tried the widely used LSTM-CRF sequence tagging model, but also other sequence models, such as CNN, UCNN, WaveNet, SelfAttention, etc, which reaches similar performance as LSTM+CRF. This sheds a light on the traditional sequence tagging models. Since the aspect of emphasis for different sequence tagging models varies substantially, ensembling these models adds diversity to the final system. By doing so, our system achieves good performance on the task of Chinese medical text attributes extraction (subtask 2 of CCKS 2019 task 1).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Advanced Text Analysis Techniques

MethodsLinear Layer · Dilated Causal Convolution · Attention Dropout · Weight Decay · Adam · Dropout · WordPiece · Mixture of Logistic Distributions · Multi-Head Attention · Residual Connection