Total Recall: a Customized Continual Learning Method for Neural Semantic Parsers
Zhuang Li, Lizhen Qu, Gholamreza Haffari

TL;DR
This paper introduces TotalRecall, a novel continual learning method for neural semantic parsers that improves performance and efficiency by specialized memory replay and a two-stage training process, outperforming existing algorithms.
Contribution
The paper proposes TotalRecall, a new continual learning approach tailored for neural semantic parsing, addressing the unique challenges of structured outputs and improving generalization and speed.
Findings
TotalRecall outperforms SOTA continual learning algorithms in semantic parsing tasks.
It achieves 3-6 times speedup compared to re-training from scratch.
Extensive experiments validate the effectiveness of the proposed method.
Abstract
This paper investigates continual learning for semantic parsing. In this setting, a neural semantic parser learns tasks sequentially without accessing full training data from previous tasks. Direct application of the SOTA continual learning algorithms to this problem fails to achieve comparable performance with re-training models with all seen tasks because they have not considered the special properties of structured outputs yielded by semantic parsers. Therefore, we propose TotalRecall, a continual learning method designed for neural semantic parsers from two aspects: i) a sampling method for memory replay that diversifies logical form templates and balances distributions of parse actions in a memory; ii) a two-stage training method that significantly improves generalization capability of the parsers across tasks. We conduct extensive experiments to study the research problems…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning
