LaMSUM: Amplifying Voices Against Harassment through LLM Guided Extractive Summarization of User Incident Reports

Garima Chhikara; Anurag Sharma; V. Gurucharan; Kripabandhu Ghosh; Abhijnan Chakraborty

arXiv:2406.15809·cs.CL·April 20, 2026·3 cites

LaMSUM: Amplifying Voices Against Harassment through LLM Guided Extractive Summarization of User Incident Reports

Garima Chhikara, Anurag Sharma, V. Gurucharan, Kripabandhu Ghosh, Abhijnan Chakraborty

PDF

TL;DR

LaMSUM is a novel multi-level framework that leverages Large Language Models to generate extractive summaries of user incident reports, aiding harassment prevention efforts.

Contribution

Introduces LaMSUM, the first approach to extractive summarization using LLMs, combining summarization and voting methods for large incident report collections.

Findings

01

LaMSUM outperforms existing extractive summarization methods.

02

Evaluated on four popular LLMs including GPT-4o.

03

Effectively processes large, code-mixed language incident reports.

Abstract

Citizen reporting platforms help the public and authorities stay informed about sexual harassment incidents. However, the high volume of data shared on these platforms makes reviewing each individual case challenging. Therefore, a summarization algorithm capable of processing and understanding various code-mixed languages is essential. In recent years, Large Language Models (LLMs) have shown exceptional performance in NLP tasks, including summarization. LLMs inherently produce abstractive summaries by paraphrasing the original text, while the generation of extractive summaries - selecting specific subsets from the original text - through LLMs remains largely unexplored. Moreover, LLMs have a limited context window size, restricting the amount of data that can be processed at once. We tackle these challenges by introducing LaMSUM, a novel multi-level framework combining summarization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.