Doctoral Advisor or Medical Condition: Towards Entity-specific Rankings of Knowledge Base Properties [Extended Version]
Simon Razniewski, Vevake Balaraman, Werner Nutt

TL;DR
This paper introduces a human-annotated dataset for ranking knowledge base properties, analyzes factors influencing property importance, and develops a combined approach that improves prediction accuracy over baseline methods.
Contribution
It presents a new dataset of human preferences for property ranking, identifies key factors influencing importance, and proposes a combined method that outperforms existing techniques.
Findings
Humans show high agreement (87.5%) on property preferences.
Baseline models achieve only 61.3% precision.
Combining factors increases precision to 74%.
Abstract
In knowledge bases such as Wikidata, it is possible to assert a large set of properties for entities, ranging from generic ones such as name and place of birth to highly profession-specific or background-specific ones such as doctoral advisor or medical condition. Determining a preference or ranking in this large set is a challenge in tasks such as prioritisation of edits or natural-language generation. Most previous approaches to ranking knowledge base properties are purely data-driven, that is, as we show, mistake frequency for interestingness. In this work, we have developed a human-annotated dataset of 350 preference judgments among pairs of knowledge base properties for fixed entities. From this set, we isolate a subset of pairs for which humans show a high level of agreement (87.5% on average). We show, however, that baseline and state-of-the-art techniques achieve only 61.3%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
