On the Limits of Learning to Actively Learn Semantic Representations
Omri Koshorek, Gabriel Stanovsky, Yichu Zhou, Vivek Srikumar, Jonathan, Berant

TL;DR
This paper investigates the effectiveness of learning to actively learn semantic representations, revealing that current methods offer limited improvements due to the interaction between optimization and sample selection.
Contribution
The study critically evaluates LTAL for semantic representations, identifying key factors limiting its effectiveness and highlighting the importance of optimization-policy interaction.
Findings
Oracle policy does not significantly outperform random sampling.
Successful LTAL applications show minimal dependence on optimization.
Optimization stochasticity impacts sample selection in LTAL.
Abstract
One of the goals of natural language understanding is to develop models that map sentences into meaning representations. However, training such models requires expensive annotation of complex structures, which hinders their adoption. Learning to actively-learn (LTAL) is a recent paradigm for reducing the amount of labeled data by learning a policy that selects which samples should be labeled. In this work, we examine LTAL for learning semantic representations, such as QA-SRL. We show that even an oracle policy that is allowed to pick examples that maximize performance on the test set (and constitutes an upper bound on the potential of LTAL), does not substantially improve performance compared to a random policy. We investigate factors that could explain this finding and show that a distinguishing characteristic of successful applications of LTAL is the interaction between optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest
