Mapping and Cleaning Open Commonsense Knowledge Bases with Generative   Translation

Julien Romero; Simon Razniewski

arXiv:2306.12766·cs.CL·June 23, 2023

Mapping and Cleaning Open Commonsense Knowledge Bases with Generative Translation

Julien Romero, Simon Razniewski

PDF

Open Access 1 Repo 6 Models

TL;DR

This paper introduces a generative translation method using language models to map and clean open commonsense knowledge bases into fixed schemas, improving accuracy and reducing noise compared to traditional methods.

Contribution

It presents a novel generative translation approach for mapping open KBs into fixed schemas, specifically for commonsense knowledge, balancing accuracy and noise reduction.

Findings

01

Higher mapping accuracy than rule-based methods

02

Reduces noise compared to purely generative approaches

03

Balances traditional and modern KB construction techniques

Abstract

Structured knowledge bases (KBs) are the backbone of many know\-ledge-intensive applications, and their automated construction has received considerable attention. In particular, open information extraction (OpenIE) is often used to induce structure from a text. However, although it allows high recall, the extracted knowledge tends to inherit noise from the sources and the OpenIE algorithm. Besides, OpenIE tuples contain an open-ended, non-canonicalized set of relations, making the extracted knowledge's downstream exploitation harder. In this paper, we study the problem of mapping an open KB into the fixed schema of an existing KB, specifically for the case of commonsense knowledge. We propose approaching the problem by generative translation, i.e., by training a language model to generate fixed-schema assertions from open ones. Experiments show that this approach occupies a sweet spot…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Aunsiels/GenT
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Topic Modeling