Semi-Supervised Lifelong Language Learning

Yingxiu Zhao; Yinhe Zheng; Bowen Yu; Zhiliang Tian; Dongkyu Lee; Jian; Sun; Haiyang Yu; Yongbin Li; Nevin L. Zhang

arXiv:2211.13050·cs.CL·November 24, 2022·1 cites

Semi-Supervised Lifelong Language Learning

Yingxiu Zhao, Yinhe Zheng, Bowen Yu, Zhiliang Tian, Dongkyu Lee, Jian, Sun, Haiyang Yu, Yongbin Li, Nevin L. Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces semi-supervised lifelong language learning (SSLL), enabling models to learn sequential language tasks using both labeled and unlabeled data, addressing catastrophic forgetting and improving knowledge transfer.

Contribution

It proposes a novel SSLL setting and a model with task-specific modules, virtual supervision, and backward augmentation to leverage unlabeled data effectively.

Findings

01

Model outperforms baselines on various language tasks.

02

Unlabeled data enhances knowledge transfer and reduces forgetting.

03

The approach demonstrates the effectiveness of semi-supervised learning in lifelong language tasks.

Abstract

Lifelong learning aims to accumulate knowledge and alleviate catastrophic forgetting when learning tasks sequentially. However, existing lifelong language learning methods only focus on the supervised learning setting. Unlabeled data, which can be easily accessed in real-world scenarios, are underexplored. In this paper, we explore a novel setting, semi-supervised lifelong language learning (SSLL), where a model learns sequentially arriving language tasks with both labeled and unlabeled data. We propose an unlabeled data enhanced lifelong learner to explore SSLL. Specially, we dedicate task-specific modules to alleviate catastrophic forgetting and design two modules to exploit unlabeled data: (1) a virtual supervision enhanced task solver is constructed on a teacher-student framework to mine the underlying knowledge from unlabeled data; and (2) a backward augmented learner is built to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alibabaresearch/damo-convai
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning