Deep Stacking Networks for Low-Resource Chinese Word Segmentation with   Transfer Learning

Jingjing Xu; Xu Sun; Sujian Li; Xiaoyan Cai; Bingzhen Wei

arXiv:1711.01427·cs.CL·November 7, 2017

Deep Stacking Networks for Low-Resource Chinese Word Segmentation with Transfer Learning

Jingjing Xu, Xu Sun, Sujian Li, Xiaoyan Cai, Bingzhen Wei

PDF

Open Access

TL;DR

This paper introduces a deep stacking framework that leverages transfer learning and multiple domain-based models to enhance Chinese word segmentation performance in low-resource scenarios.

Contribution

It proposes a novel deep stacking approach with communication paths among models, effectively integrating diverse domain data for low-resource Chinese word segmentation.

Findings

01

Significant performance improvements over baselines on six low-resource datasets.

02

Effective integration of multiple domain datasets through deep stacking networks.

03

Various structures of stacking networks demonstrate versatility and robustness.

Abstract

In recent years, neural networks have proven to be effective in Chinese word segmentation. However, this promising performance relies on large-scale training data. Neural networks with conventional architectures cannot achieve the desired results in low-resource datasets due to the lack of labelled training data. In this paper, we propose a deep stacking framework to improve the performance on word segmentation tasks with insufficient data by integrating datasets from diverse domains. Our framework consists of two parts, domain-based models and deep stacking networks. The domain-based models are used to learn knowledge from different datasets. The deep stacking networks are designed to integrate domain-based models. To reduce model conflicts, we innovatively add communication paths among models and design various structures of deep stacking networks, including Gaussian-based Stacking…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis