SememeLM: A Sememe Knowledge Enhanced Method for Long-tail Relation Representation
Shuyi Li, Shaojuan Wu, Xiaowang Zhang, Zhiyong Feng

TL;DR
SememeLM enhances long-tail relation representations by integrating sememe knowledge with language models, addressing the challenge of limited semantic features for rare relations, and outperforms existing methods in relation distinction tasks.
Contribution
Proposes SememeLM, a novel method that incorporates sememe knowledge and a consistency alignment module to improve long-tail relation representation in language models.
Findings
Outperforms state-of-the-art methods on word analogy datasets.
Effectively distinguishes subtle differences in relation representations.
Reduces noise in external knowledge integration.
Abstract
Recognizing relations between two words is a fundamental task with the broad applications. Different from extracting relations from text, it is difficult to identify relations among words without their contexts. Especially for long-tail relations, it becomes more difficult due to inadequate semantic features. Existing approaches based on language models (LMs) utilize rich knowledge of LMs to enhance the semantic features of relations. However, they capture uncommon relations while overlooking less frequent but meaningful ones since knowledge of LMs seriously relies on trained data where often represents common relations. On the other hand, long-tail relations are often uncommon in training data. It is interesting but not trivial to use external knowledge to enrich LMs due to collecting corpus containing long-tail relationships is hardly feasible. In this paper, we propose a sememe…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Semantic Web and Ontologies
