Ranking Entity Based on Both of Word Frequency and Word Sematic Features
Xiao-Bo Jin, Guang-Gang Geng, Kaizhu Huang, Zhi-Wei Yan

TL;DR
This paper presents a ranking method for entity search that combines word frequency and semantic features, demonstrating high performance in Baidu Cup 2016 Challenge across multiple entity types.
Contribution
The paper introduces a novel set of similarity features based on word frequency and semantics, along with a ranking architecture that achieved top results in a competitive challenge.
Findings
Achieved first place in Baidu Cup 2016 Challenge for entity search
Developed effective similarity features combining frequency and semantics
Demonstrated superior ranking performance across multiple entity types
Abstract
Entity search is a new application meeting either precise or vague requirements from the search engines users. Baidu Cup 2016 Challenge just provided such a chance to tackle the problem of the entity search. We achieved the first place with the average MAP scores on 4 tasks including movie, tvShow, celebrity and restaurant. In this paper, we propose a series of similarity features based on both of the word frequency features and the word semantic features and describe our ranking architecture and experiment details.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Web Data Mining and Analysis · Data Quality and Management
