Where's the Learning in Representation Learning for Compositional Semantics and the Case of Thematic Fit
Mughilan Muthupari, Samrat Halder, Asad Sayeed, Yuval Marton

TL;DR
This paper investigates where learning occurs in representation learning for compositional semantics, revealing that for some NLP tasks, random embeddings perform as well as pretrained ones, and analyzing the influence of training data size and task type.
Contribution
It provides a nuanced analysis of the sources of learned information in embeddings and models, especially in the context of semantic role prediction and thematic fit estimation.
Findings
Random embeddings can perform as well as pretrained embeddings in certain tasks.
The relation between task quality and training data size is non-monotonous.
Easier, per-verb task variants help analyze learning dynamics.
Abstract
Observing that for certain NLP tasks, such as semantic role prediction or thematic fit estimation, random embeddings perform as well as pretrained embeddings, we explore what settings allow for this and examine where most of the learning is encoded: the word embeddings, the semantic role embeddings, or ``the network''. We find nuanced answers, depending on the task and its relation to the training objective. We examine these representation learning aspects in multi-task learning, where role prediction and role-filling are supervised tasks, while several thematic fit tasks are outside the models' direct supervision. We observe a non-monotonous relation between some tasks' quality score and the training data size. In order to better understand this observation, we analyze these results using easier, per-verb versions of these tasks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
