Mapping Process for the Task: Wikidata Statements to Text as Wikipedia   Sentences

Hoang Thang Ta; Alexander Gelbukha; Grigori Sidorov

arXiv:2210.12659·cs.CL·October 25, 2022·1 cites

Mapping Process for the Task: Wikidata Statements to Text as Wikipedia Sentences

Hoang Thang Ta, Alexander Gelbukha, Grigori Sidorov

PDF

Open Access

TL;DR

This paper presents a mapping process to convert Wikidata statements into natural language sentences for Wikipedia, aiming to automate content generation and reduce human effort in multilingual projects.

Contribution

It introduces a novel sentence-level mapping process from Wikidata statements to English Wikipedia sentences, enhancing data-to-text generation methods.

Findings

01

Effective organization of statements as quadruples and triples

02

Improved sentence structure analysis and noise filtering

03

Insights into relationships between sentence components

Abstract

Acknowledged as one of the most successful online cooperative projects in human society, Wikipedia has obtained rapid growth in recent years and desires continuously to expand content and disseminate knowledge values for everyone globally. The shortage of volunteers brings to Wikipedia many issues, including developing content for over 300 languages at the present. Therefore, the benefit that machines can automatically generate content to reduce human efforts on Wikipedia language projects could be considerable. In this paper, we propose our mapping process for the task of converting Wikidata statements to natural language text (WS2T) for Wikipedia projects at the sentence level. The main step is to organize statements, represented as a group of quadruples and triples, and then to map them to corresponding sentences in English Wikipedia. We evaluate the output corpus in various aspects:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWikis in Education and Collaboration · Natural Language Processing Techniques · Topic Modeling