Disambiguation of Patent Inventors and Assignees Using High-Resolution Geolocation Data
Greg Morrison, Massimo Riccaboni, and Fabio Pammolli

TL;DR
This paper introduces a high-resolution geolocation-based algorithm for disambiguating patent inventors and assignees, significantly improving accuracy and coverage across major patent databases.
Contribution
The paper presents a novel algorithm that uses high-resolution geolocation data to disambiguate inventors and assignees, achieving high precision and broad coverage.
Findings
High accuracy in disambiguating assignees globally.
Competitive inventor disambiguation performance.
Broadest simultaneous disambiguation across major patent collections.
Abstract
Patent data represent a significant source of information on innovation and the evolution of technology through networks of citations, co-invention and co-assignment of new patents. A major obstacle to extracting useful information from this data is the problem of name disambiguation: linking alternate spellings of individuals or institutions to a single identifier to uniquely determine the parties involved in the creation of a technology. In this paper, we describe a new algorithm that uses high-resolution geolocation to disambiguate both inventor and assignees on more than 3.6 million patents found in the European Patent Office (EPO), under the Patent Cooperation treaty (PCT), and in the US Patent and Trademark Office (USPTO). We show that our algorithm has both high precision and recall in comparison to a manual disambiguation of EPO assignee names in Boston and Paris, and show it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
