UmBERTo-MTSA @ AcCompl-It: Improving Complexity and Acceptability   Prediction with Multi-task Learning on Self-Supervised Annotations

Gabriele Sarti

arXiv:2011.05197·cs.CL·December 18, 2020

UmBERTo-MTSA @ AcCompl-It: Improving Complexity and Acceptability Prediction with Multi-task Learning on Self-Supervised Annotations

Gabriele Sarti

PDF

1 Repo

TL;DR

This paper presents a self-supervised data augmentation method using multi-task learning on annotations generated by multiple model copies, significantly enhancing neural language models' performance in complexity and acceptability prediction tasks.

Contribution

It introduces a novel self-supervised augmentation technique that leverages multi-task learning on pseudo-annotated data to improve model accuracy with limited labeled data.

Findings

01

Significant improvement in prediction quality for complexity and acceptability tasks.

02

Effective use of unlabeled data through self-supervised annotation.

03

Enhanced model robustness via multi-task training on pseudo-labels.

Abstract

This work describes a self-supervised data augmentation approach used to improve learning models' performances when only a moderate amount of labeled data is available. Multiple copies of the original model are initially trained on the downstream task. Their predictions are then used to annotate a large set of unlabeled examples. Finally, multi-task training is performed on the parallel annotations of the resulting training set, and final scores are obtained by averaging annotator-specific head predictions. Neural language models are fine-tuned using this procedure in the context of the AcCompl-it shared task at EVALITA 2020, obtaining considerable improvements in prediction quality.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gsarti/interpreting-complexity
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Dense Connections · WordPiece · Layer Normalization · Adam · Linear Warmup With Linear Decay · Attention Is All You Need · Weight Decay · Dropout · Attention Dropout