Studying Taxonomy Enrichment on Diachronic WordNet Versions

Irina Nikishina; Alexander Panchenko; Varvara Logacheva; Natalia; Loukachevitch

arXiv:2011.11536·cs.CL·November 24, 2020

Studying Taxonomy Enrichment on Diachronic WordNet Versions

Irina Nikishina, Alexander Panchenko, Varvara Logacheva, Natalia, Loukachevitch

PDF

Open Access 1 Repo

TL;DR

This paper investigates methods for enriching and extending taxonomies like WordNet across multiple languages, focusing on resource-poor settings and providing new datasets for English and Russian.

Contribution

It introduces scalable taxonomy enrichment techniques applicable to many languages and presents novel datasets for English and Russian to evaluate these methods.

Findings

01

Developed new datasets for English and Russian taxonomy enrichment

02

Proposed methods suitable for resource-scarce language settings

03

Facilitated taxonomy maintenance and extension in NLP applications

Abstract

Ontologies, taxonomies, and thesauri are used in many NLP tasks. However, most studies are focused on the creation of these lexical resources rather than the maintenance of the existing ones. Thus, we address the problem of taxonomy enrichment. We explore the possibilities of taxonomy extension in a resource-poor setting and present methods which are applicable to a large number of languages. We create novel English and Russian datasets for training and evaluating taxonomy enrichment models and describe a technique of creating such datasets for other languages.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

skoltech-nlp/diachronic-wordnets
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling