Survey on English Entity Linking on Wikidata
Cedric M\"oller, Jens Lehmann, Ricardo Usbeck

TL;DR
This survey reviews Wikidata-based Entity Linking, highlighting current datasets, methods, and the underutilization of Wikidata's unique features, suggesting avenues for future research to improve linking quality.
Contribution
It systematically analyzes existing Wikidata Entity Linking datasets and approaches, identifying gaps and opportunities to leverage Wikidata's unique hyper-relational structure.
Findings
Current datasets do not exploit Wikidata's unique features.
Most approaches use Wikidata like other knowledge graphs.
Significant potential exists to improve methods by utilizing hyper-relational data.
Abstract
Wikidata is a frequently updated, community-driven, and multilingual knowledge graph. Hence, Wikidata is an attractive basis for Entity Linking, which is evident by the recent increase in published papers. This survey focuses on four subjects: (1) Which Wikidata Entity Linking datasets exist, how widely used are they and how are they constructed? (2) Do the characteristics of Wikidata matter for the design of Entity Linking datasets and if so, how? (3) How do current Entity Linking approaches exploit the specific characteristics of Wikidata? (4) Which Wikidata characteristics are unexploited by existing Entity Linking approaches? This survey reveals that current Wikidata-specific Entity Linking datasets do not differ in their annotation scheme from schemes for other knowledge graphs like DBpedia. Thus, the potential for multilingual and time-dependent datasets, naturally suited for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks
