PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Population
Tianqing Fang, Quyet V. Do, Hongming Zhang, Yangqiu Song, Ginny Y., Wong, Simon See

TL;DR
PseudoReasoner is a semi-supervised framework that leverages pseudo labels generated by a teacher model to improve commonsense knowledge base population, especially enhancing out-of-domain generalization.
Contribution
It introduces a novel semi-supervised learning approach using pseudo labels and a filtering procedure, achieving state-of-the-art results in CSKB population.
Findings
Improves KG-BERT performance by 3.3 points overall.
Enhances out-of-domain performance by 5.3 points.
Achieves state-of-the-art results on CSKB population.
Abstract
Commonsense Knowledge Base (CSKB) Population aims at reasoning over unseen entities and assertions on CSKBs, and is an important yet hard commonsense reasoning task. One challenge is that it requires out-of-domain generalization ability as the source CSKB for training is of a relatively smaller scale (1M) while the whole candidate space for population is way larger (200M). We propose PseudoReasoner, a semi-supervised learning framework for CSKB population that uses a teacher model pre-trained on CSKBs to provide pseudo labels on the unlabeled candidate dataset for a student model to learn from. The teacher can be a generative model rather than restricted to discriminative models as previous works. In addition, we design a new filtering procedure for pseudo labels based on influence function and the student model's prediction to further improve the performance. The framework can improve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning
MethodsBalanced Selection
