INCLUSIFY: A benchmark and a model for gender-inclusive German

David Pomerenke

arXiv:2212.02564·cs.CL·December 7, 2022

INCLUSIFY: A benchmark and a model for gender-inclusive German

David Pomerenke

PDF

Open Access

TL;DR

This paper introduces INCLUSIFY, a benchmark and model for detecting and suggesting gender-inclusive language in German, aiming to promote gender equality through NLP tools.

Contribution

It defines NLP tasks for gender-inclusive language, provides a dataset and benchmark, and presents a model combining a database with pre-trained models for improved identification and reformulation.

Findings

01

Recall of 0.89 and precision of 0.82 in identifying exclusive language

02

44% of real-world texts had a top suggestion chosen

03

Proposes future directions with end-to-end models and large language models

Abstract

Gender-inclusive language is important for achieving gender equality in languages with gender inflections, such as German. While stirring some controversy, it is increasingly adopted by companies and political institutions. A handful of tools have been developed to help people use gender-inclusive language by identifying instances of the generic masculine and providing suggestions for more inclusive reformulations. In this report, we define the underlying tasks in terms of natural language processing, and present a dataset and measures for benchmarking them. We also present a model that implements these tasks, by combining an inclusive language database with an elaborate sequence of processing steps via standard pre-trained models. Our model achieves a recall of 0.89 and a precision of 0.82 in our benchmark for identifying exclusive language; and one of its top five suggestions is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGender Studies in Language