Deep Stacking Networks for Low-Resource Chinese Word Segmentation with Transfer Learning
Jingjing Xu, Xu Sun, Sujian Li, Xiaoyan Cai, Bingzhen Wei

TL;DR
This paper introduces a deep stacking framework that leverages transfer learning and multiple domain-based models to enhance Chinese word segmentation performance in low-resource scenarios.
Contribution
It proposes a novel deep stacking approach with communication paths among models, effectively integrating diverse domain data for low-resource Chinese word segmentation.
Findings
Significant performance improvements over baselines on six low-resource datasets.
Effective integration of multiple domain datasets through deep stacking networks.
Various structures of stacking networks demonstrate versatility and robustness.
Abstract
In recent years, neural networks have proven to be effective in Chinese word segmentation. However, this promising performance relies on large-scale training data. Neural networks with conventional architectures cannot achieve the desired results in low-resource datasets due to the lack of labelled training data. In this paper, we propose a deep stacking framework to improve the performance on word segmentation tasks with insufficient data by integrating datasets from diverse domains. Our framework consists of two parts, domain-based models and deep stacking networks. The domain-based models are used to learn knowledge from different datasets. The deep stacking networks are designed to integrate domain-based models. To reduce model conflicts, we innovatively add communication paths among models and design various structures of deep stacking networks, including Gaussian-based Stacking…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
