Gendered Words and Grant Rates: A Textual Analysis of Disparate Outcomes in the Patent System
Deborah Gerhardt, Miriam Marcowitz-Bitton, W. Michael Schuster,, Avshalom Elmalech, Omri Suissa, Moshe Mash

TL;DR
This study uses machine learning and NLP to analyze patent texts, revealing gendered linguistic patterns, predicting patent grants with over 60% accuracy, and showing women often invent in higher rejection fields, raising concerns about gender bias.
Contribution
It demonstrates that gender can be inferred from patent text, predicts patent outcomes based on textual features, and uncovers gender disparities across technological areas.
Findings
Gender can be identified from patent text without names.
Textual features influence patent grant predictions more than gender.
Women tend to invent in fields with higher rejection rates.
Abstract
Text is a vehicle to convey information that reflects the writer's linguistic style and communicative patterns. By studying these attributes, we can discover latent insights about the author and their underlying message. This article uses such an approach to better understand patent applications and their inventors. While prior research focuses on patent metadata, we employ machine learning and natural language processing to extract hidden information from the words in patent applications. Through these methods, we find that inventor gender can often be identified from textual attributes - even without knowing the inventor's name. This ability to discern gender through text suggests that anonymized patent examination - often proposed as a solution to mitigate disparities in patent grant rates - may not fully address gendered outcomes in securing a patent. Our study also investigates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsResearch, Science, and Academia · scientometrics and bibliometrics research
