Summarizing User-generated Textual Content: Motivation and Methods for   Fairness in Algorithmic Summaries

Abhisek Dash; Anurag Shandilya; Arindam Biswas; Kripabandhu Ghosh,; Saptarshi Ghosh; Abhijnan Chakraborty

arXiv:1810.09147·cs.IR·September 4, 2019

Summarizing User-generated Textual Content: Motivation and Methods for Fairness in Algorithmic Summaries

Abhisek Dash, Anurag Shandilya, Arindam Biswas, Kripabandhu Ghosh,, Saptarshi Ghosh, Abhijnan Chakraborty

PDF

1 Repo

TL;DR

This paper highlights the importance of fairness in user-generated text summarization, revealing biases in existing algorithms and proposing new methods to ensure equitable representation of diverse social groups.

Contribution

It introduces the first framework for fair summarization of user-generated content, addressing social group biases and proposing fairness-preserving algorithms.

Findings

01

Existing algorithms often under-represent certain social groups.

02

Fairness issues lead to unequal exposure of groups in summaries.

03

Proposed algorithms improve fairness without sacrificing summary quality.

Abstract

As the amount of user-generated textual content grows rapidly, text summarization algorithms are increasingly being used to provide users a quick overview of the information content. Traditionally, summarization algorithms have been evaluated only based on how well they match human-written summaries (e.g. as measured by ROUGE scores). In this work, we propose to evaluate summarization algorithms from a completely new perspective that is important when the user-generated data to be summarized comes from different socially salient user groups, e.g. men or women, Caucasians or African-Americans, or different political groups (Republicans or Democrats). In such cases, we check whether the generated summaries fairly represent these different social groups. Specifically, considering that an extractive summarization algorithm selects a subset of the textual units (e.g. microblogs) in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ad93/FairSumm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.