Modelling Commonsense Properties using Pre-Trained Bi-Encoders
Amit Gajbhiye, Luis Espinosa-Anke, Steven Schockaert

TL;DR
This paper investigates the limitations of current language models in capturing commonsense properties, proposing a fine-tuning approach with concept and property encoders that significantly improves prediction accuracy.
Contribution
It introduces a novel method of fine-tuning separate concept and property encoders to better model commonsense properties, outperforming standard language models.
Findings
Fine-tuned encoders outperform direct language model fine-tuning in predicting properties.
Performance drops significantly when training and testing concepts do not overlap.
The approach improves unsupervised hypernym discovery results.
Abstract
Grasping the commonsense properties of everyday concepts is an important prerequisite to language understanding. While contextualised language models are reportedly capable of predicting such commonsense properties with human-level accuracy, we argue that such results have been inflated because of the high similarity between training and test concepts. This means that models which capture concept similarity can perform well, even if they do not capture any knowledge of the commonsense properties themselves. In settings where there is no overlap between the properties that are considered during training and testing, we find that the empirical performance of standard language models drops dramatically. To address this, we study the possibility of fine-tuning language models to explicitly model concepts and their properties. In particular, we train separate concept and property encoders on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsTest
