Cross-Lingual Fine-Grained Entity Typing
Nila Selvaraj, Yasumasa Onoe, and Greg Durrett

TL;DR
This paper introduces a cross-lingual fine-grained entity typing model that generalizes to over 100 languages, including unseen ones, and demonstrates superior performance over baselines in predicting entity types across languages and entities.
Contribution
The paper presents a unified model capable of handling over 100 languages for entity typing, with a focus on generalization to unseen languages and entities, outperforming simple baselines.
Findings
Outperforms baselines on unseen languages like Japanese, Tamil, Arabic, Serbian, and Persian.
Significantly improves entity type prediction for unseen entities, even in unseen languages.
Human evaluation confirms strong relevance of predicted types in challenging settings.
Abstract
The growth of cross-lingual pre-trained models has enabled NLP tools to rapidly generalize to new languages. While these models have been applied to tasks involving entities, their ability to explicitly predict typological features of these entities across languages has not been established. In this paper, we present a unified cross-lingual fine-grained entity typing model capable of handling over 100 languages and analyze this model's ability to generalize to languages and entities unseen during training. We train this model on cross-lingual training data collected from Wikipedia hyperlinks in multiple languages (training languages). During inference, our model takes an entity mention and context in a particular language (test language, possibly not in the training languages) and predicts fine-grained types for that entity. Generalizing to new languages and unseen entities are the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
