Ranking Entity Based on Both of Word Frequency and Word Sematic Features

Xiao-Bo Jin; Guang-Gang Geng; Kaizhu Huang; Zhi-Wei Yan

arXiv:1608.01068·cs.IR·August 4, 2016

Ranking Entity Based on Both of Word Frequency and Word Sematic Features

Xiao-Bo Jin, Guang-Gang Geng, Kaizhu Huang, Zhi-Wei Yan

PDF

Open Access

TL;DR

This paper presents a ranking method for entity search that combines word frequency and semantic features, demonstrating high performance in Baidu Cup 2016 Challenge across multiple entity types.

Contribution

The paper introduces a novel set of similarity features based on word frequency and semantics, along with a ranking architecture that achieved top results in a competitive challenge.

Findings

01

Achieved first place in Baidu Cup 2016 Challenge for entity search

02

Developed effective similarity features combining frequency and semantics

03

Demonstrated superior ranking performance across multiple entity types

Abstract

Entity search is a new application meeting either precise or vague requirements from the search engines users. Baidu Cup 2016 Challenge just provided such a chance to tackle the problem of the entity search. We achieved the first place with the average MAP scores on 4 tasks including movie, tvShow, celebrity and restaurant. In this paper, we propose a series of similarity features based on both of the word frequency features and the word semantic features and describe our ranking architecture and experiment details.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Web Data Mining and Analysis · Data Quality and Management