Learning Knowledge-Enhanced Contextual Language Representations for Domain Natural Language Understanding
Ruyao Xu, Taolin Zhang, Chengyu Wang, Zhongjie Duan, Cen Chen, Minghui, Qiu, Dawei Cheng, Xiaofeng He, Weining Qian

TL;DR
This paper introduces KANGAROO, a novel framework for learning knowledge-enhanced language representations tailored for closed-domain NLP tasks, effectively capturing implicit graph structures and addressing knowledge sparsity.
Contribution
The paper proposes a new method combining shallow relational triples and hyperbolic embeddings, along with contrastive learning on subgraphs, to improve KEPLMs in closed domains.
Findings
KANGAROO outperforms existing KEPLM methods in closed-domain NLP tasks.
The approach effectively handles knowledge sparsity and hierarchical structures.
Significant improvements in both full and few-shot learning settings.
Abstract
Knowledge-Enhanced Pre-trained Language Models (KEPLMs) improve the performance of various downstream NLP tasks by injecting knowledge facts from large-scale Knowledge Graphs (KGs). However, existing methods for pre-training KEPLMs with relational triples are difficult to be adapted to close domains due to the lack of sufficient domain graph semantics. In this paper, we propose a Knowledge-enhanced lANGuAge Representation learning framework for various clOsed dOmains (KANGAROO) via capturing the implicit graph structure among the entities. Specifically, since the entity coverage rates of closed-domain KGs can be relatively low and may exhibit the global sparsity phenomenon for knowledge injection, we consider not only the shallow relational representations of triples but also the hyperbolic embeddings of deep hierarchical entity-class structures for effective knowledge fusion.Moreover,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsContrastive Learning
