TL;DR
This paper presents a self-supervised data augmentation method using multi-task learning on annotations generated by multiple model copies, significantly enhancing neural language models' performance in complexity and acceptability prediction tasks.
Contribution
It introduces a novel self-supervised augmentation technique that leverages multi-task learning on pseudo-annotated data to improve model accuracy with limited labeled data.
Findings
Significant improvement in prediction quality for complexity and acceptability tasks.
Effective use of unlabeled data through self-supervised annotation.
Enhanced model robustness via multi-task training on pseudo-labels.
Abstract
This work describes a self-supervised data augmentation approach used to improve learning models' performances when only a moderate amount of labeled data is available. Multiple copies of the original model are initially trained on the downstream task. Their predictions are then used to annotate a large set of unlabeled examples. Finally, multi-task training is performed on the parallel annotations of the resulting training set, and final scores are obtained by averaging annotator-specific head predictions. Neural language models are fine-tuned using this procedure in the context of the AcCompl-it shared task at EVALITA 2020, obtaining considerable improvements in prediction quality.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Dense Connections · WordPiece · Layer Normalization · Adam · Linear Warmup With Linear Decay · Attention Is All You Need · Weight Decay · Dropout · Attention Dropout
