Generating Concept Lexicalizations via Dictionary-Based Cross-Lingual Sense Projection

David Basil; Chirooth Girigowda; Bradley Hauer; Sahir Momin; Ning Shi; Grzegorz Kondrak

arXiv:2604.14397·cs.CL·April 17, 2026

Generating Concept Lexicalizations via Dictionary-Based Cross-Lingual Sense Projection

David Basil, Chirooth Girigowda, Bradley Hauer, Sahir Momin, Ning Shi, Grzegorz Kondrak

PDF

TL;DR

This paper presents a method for expanding lexical resources into new languages by projecting English senses onto target languages using a bilingual dictionary-enhanced aligner, improving precision and interpretability.

Contribution

The authors introduce a dictionary-augmented sense projection method that enhances cross-lingual lexical resource expansion with minimal external resources.

Findings

01

The method improves projection precision over prior approaches.

02

It remains interpretable and resource-efficient.

03

Evaluation across multiple languages demonstrates effectiveness.

Abstract

We study the task of automatically expanding WordNet-style lexical resources to new languages through sense generation. We generate senses by associating target-language lemmas with existing lexical concepts via semantic projection. Given a sense-tagged English corpus and its translation, our method projects English synsets onto aligned target-language tokens and assigns the corresponding lemmas to those synsets. To generate these alignments and ensure their quality, we augment a pre-trained base aligner with a bilingual dictionary, which is also used to filter out incorrect sense projections. We evaluate the method on multiple languages, comparing it to prior methods, as well as dictionary-based and large language model baselines. Results show that the proposed project-and-filter strategy improves precision while remaining interpretable and requiring few external resources. We plan to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.