TWeddit : A Dataset of Triggering Stories Predominantly Shared by Women on Reddit
Shirlene Rose Bandela, Sanjeev Parthasarathy, Vaibhav Garg

TL;DR
This paper introduces TWeddit, a curated dataset of Reddit stories predominantly shared by women that are related to triggering experiences like miscarriage and sexual violence, aiming to facilitate research on sensitive social media content.
Contribution
The paper presents a novel, annotated dataset of triggering stories from Reddit, addressing the lack of such datasets and enabling future research on sensitive online narratives.
Findings
Stories express distinct topics and moral foundations.
Annotated stories reveal linguistic patterns related to triggering experiences.
Dataset supports diverse research on social media and mental health.
Abstract
Warning: This paper may contain examples and topics that may be disturbing to some readers, especially survivors of miscarriage and sexual violence. People affected by abortion, miscarriage, or sexual violence often share their experiences on social media to express emotions and seek support. On public platforms like Reddit, where users can post long, detailed narratives (up to 40,000 characters), readers may be exposed to distressing content. Although Reddit allows manual trigger warnings, many users omit them due to limited awareness or uncertainty about which categories apply. There is scarcity of datasets on Reddit stories labeled for triggering experiences. We propose a curated Reddit dataset, TWeddit, covering triggering experiences related to issues majorly faced by women. Our linguistic analyses show that annotated stories in TWeddit express distinct topics and moral…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Digital Communication and Language · Hate Speech and Cyberbullying Detection
