TWeddit : A Dataset of Triggering Stories Predominantly Shared by Women on Reddit

Shirlene Rose Bandela; Sanjeev Parthasarathy; Vaibhav Garg

arXiv:2601.11819·cs.CL·January 21, 2026

TWeddit : A Dataset of Triggering Stories Predominantly Shared by Women on Reddit

Shirlene Rose Bandela, Sanjeev Parthasarathy, Vaibhav Garg

PDF

Open Access

TL;DR

This paper introduces TWeddit, a curated dataset of Reddit stories predominantly shared by women that are related to triggering experiences like miscarriage and sexual violence, aiming to facilitate research on sensitive social media content.

Contribution

The paper presents a novel, annotated dataset of triggering stories from Reddit, addressing the lack of such datasets and enabling future research on sensitive online narratives.

Findings

01

Stories express distinct topics and moral foundations.

02

Annotated stories reveal linguistic patterns related to triggering experiences.

03

Dataset supports diverse research on social media and mental health.

Abstract

Warning: This paper may contain examples and topics that may be disturbing to some readers, especially survivors of miscarriage and sexual violence. People affected by abortion, miscarriage, or sexual violence often share their experiences on social media to express emotions and seek support. On public platforms like Reddit, where users can post long, detailed narratives (up to 40,000 characters), readers may be exposed to distressing content. Although Reddit allows manual trigger warnings, many users omit them due to limited awareness or uncertainty about which categories apply. There is scarcity of datasets on Reddit stories labeled for triggering experiences. We propose a curated Reddit dataset, TWeddit, covering triggering experiences related to issues majorly faced by women. Our linguistic analyses show that annotated stories in TWeddit express distinct topics and moral…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Digital Communication and Language · Hate Speech and Cyberbullying Detection