It Runs in the Family: Searching for Synonyms Using Digitized Family Trees
Aviad Elyashar, Rami Puzis, Michael Fire

TL;DR
This paper introduces GRAFT, a novel graph-based algorithm utilizing genealogical data to improve synonym suggestion for names, outperforming existing methods in accuracy and scalability.
Contribution
GRAFT is a new algorithm that leverages digitized family trees and network algorithms to enhance name synonym suggestion beyond traditional pattern matching methods.
Findings
GRAFT outperforms 10 existing algorithms in synonym suggestion accuracy.
GRAFT effectively handles large-scale genealogical datasets with over 16 million profiles.
The approach improves synonym suggestion for both forenames and surnames.
Abstract
Searching for a person's name is a common online activity. However, Web search engines provide few accurate results to queries containing names. In contrast to a general word which has only one correct spelling, there are several legitimate spellings of a given name. Today, most techniques used to suggest synonyms in online search are based on pattern matching and phonetic encoding, however they often perform poorly. As a result, there is a need for an effective tool for improved synonym suggestion. In this paper, we propose a revolutionary approach for tackling the problem of synonym suggestion. Our novel algorithm, GRAFT, utilizes historical data collected from genealogy websites, along with network algorithms. GRAFT is a general algorithm that suggests synonyms using a graph based on names derived from digitized ancestral family trees. Synonyms are extracted from this graph, which is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Natural Language Processing Techniques · Topic Modeling
