KEHRL: Learning Knowledge-Enhanced Language Representations with Hierarchical Reinforcement Learning
Dongyang Li, Taolin Zhang, Longtao Huang, Chengyu Wang, Xiaofeng He,, Hui Xue

TL;DR
KEHRL introduces a hierarchical reinforcement learning framework that jointly detects injection points and refines knowledge triples, improving factual knowledge probing and natural language understanding performance.
Contribution
This work presents a novel hierarchical reinforcement learning approach that simultaneously detects knowledge injection points and refines knowledge triples, unlike prior methods treating these steps separately.
Findings
KEHRL improves factual knowledge probing accuracy.
Enhances performance on natural language understanding tasks.
Effectively filters irrelevant knowledge triples.
Abstract
Knowledge-enhanced pre-trained language models (KEPLMs) leverage relation triples from knowledge graphs (KGs) and integrate these external data sources into language models via self-supervised learning. Previous works treat knowledge enhancement as two independent operations, i.e., knowledge injection and knowledge integration. In this paper, we propose to learn Knowledge-Enhanced language representations with Hierarchical Reinforcement Learning (KEHRL), which jointly addresses the problems of detecting positions for knowledge injection and integrating external knowledge into the model in order to avoid injecting inaccurate or irrelevant knowledge. Specifically, a high-level reinforcement learning (RL) agent utilizes both internal and prior knowledge to iteratively detect essential positions in texts for knowledge injection, which filters out less meaningful entities to avoid diverting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
