Mining Commonsense Facts from the Physical World

Yanyan Zou; Wei Lu; Xu Sun

arXiv:2002.03149·cs.CL·April 15, 2020·1 cites

Mining Commonsense Facts from the Physical World

Yanyan Zou, Wei Lu, Xu Sun

PDF

Open Access

TL;DR

This paper introduces a new method for automatically extracting commonsense facts about the physical world from raw text, enhancing knowledge base coverage by combining textual and existing knowledge sources.

Contribution

The paper proposes a novel model that fuses sequence text and knowledge base information, along with creating large datasets for commonsense knowledge base completion.

Findings

01

Model significantly outperforms baselines

02

Creates two large annotated datasets

03

Improves coverage of commonsense knowledge bases

Abstract

Textual descriptions of the physical world implicitly mention commonsense facts, while the commonsense knowledge bases explicitly represent such facts as triples. Compared to dramatically increased text data, the coverage of existing knowledge bases is far away from completion. Most of the prior studies on populating knowledge bases mainly focus on Freebase. To automatically complete commonsense knowledge bases to improve their coverage is under-explored. In this paper, we propose a new task of mining commonsense facts from the raw text that describes the physical world. We build an effective new model that fuses information from both sequence text and existing knowledge base resource. Then we create two large annotated datasets each with approximate 200k instances for commonsense knowledge base completion. Empirical results demonstrate that our model significantly outperforms baselines.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies