Teach me how to Label: Labeling Functions from Natural Language with Text-to-text Transformers
Yannis Papanikolaou

TL;DR
This paper introduces a novel method using text-to-text Transformers to convert natural language descriptions into Python labeling functions, significantly improving semantic parsing performance and advancing natural language-based data labeling.
Contribution
The paper presents a new approach to semantic parsing with pre-trained text-to-text Transformers, achieving state-of-the-art results on CoNaLa and a new dataset for natural language to labeling function conversion.
Findings
Achieved 3.7 BLEU point improvement on CoNaLa benchmark.
Attained a BLEU score of 0.39 on a custom dataset.
Demonstrated potential for natural language-based data labeling methods.
Abstract
Annotated data has become the most important bottleneck in training accurate machine learning models, especially for areas that require domain expertise. A recent approach to deal with the above issue proposes using natural language explanations instead of labeling individual data points, thereby increasing human annotators' efficiency as well as decreasing costs substantially. This paper focuses on the task of turning these natural language descriptions into Python labeling functions by following a novel approach to semantic parsing with pre-trained text-to-text Transformers. In a series of experiments our approach achieves a new state of the art on the semantic parsing benchmark CoNaLa, surpassing the previous best approach by 3.7 BLEU points. Furthermore, on a manually constructed dataset of natural language descriptions-labeling functions pairs we achieve a BLEU of 0.39. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
