Entity Type Recognition using an Ensemble of Distributional Semantic Models to Enhance Query Understanding
Walid Shalaby, Khalifeh Al Jadda, Mohammed Korayem, Trey Grainger

TL;DR
This paper introduces an ensemble method combining multiple distributional semantic models to accurately identify entity types in short search queries within the recruitment domain, significantly improving query understanding.
Contribution
It presents a novel ensemble approach that integrates encyclopedic, job posting, linguistic, and ontological data for entity recognition in search queries, outperforming existing models.
Findings
Achieves 97% F1 score on real-world recruitment data.
Outperforms state-of-the-art Wikipedia-based word2vec models.
Effectively combines diverse semantic sources for improved entity classification.
Abstract
We present an ensemble approach for categorizing search query entities in the recruitment domain. Understanding the types of entities expressed in a search query (Company, Skill, Job Title, etc.) enables more intelligent information retrieval based upon those entities compared to a traditional keyword-based search. Because search queries are typically very short, leveraging a traditional bag-of-words model to identify entity types would be inappropriate due to the lack of contextual information. Our approach instead combines clues from different sources of varying complexity in order to collect real-world knowledge about query entities. We employ distributional semantic representations of query entities through two models: 1) contextual vectors generated from encyclopedic corpora like Wikipedia, and 2) high dimensional word embedding vectors generated from millions of job postings using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
