Extracting Incidents, Effects, and Requested Advice from MeToo Posts
Vaibhav Garg, Jiaqing Yuan, Rujie Xi, and Munindar P. Singh

TL;DR
This paper presents a natural language processing model to extract incident, effect, and advice-seeking information from Reddit MeToo posts, aiding helpers in understanding survivors' needs efficiently.
Contribution
It introduces a novel sentence classification model for MeToo posts and provides a large labeled dataset, MeThree, for further research.
Findings
Model achieves macro F1 score of 0.82 on cross-validation.
MeThree dataset contains 8,947 labeled sentences.
LIWC-22 analysis reveals language differences across categories.
Abstract
Survivors of sexual harassment frequently share their experiences on social media, revealing their feelings and emotions and seeking advice. We observed that on Reddit, survivors regularly share long posts that describe a combination of (i) a sexual harassment incident, (ii) its effect on the survivor, including their feelings and emotions, and (iii) the advice being sought. We term such posts MeToo posts, even though they may not be so tagged and may appear in diverse subreddits. A prospective helper (such as a counselor or even a casual reader) must understand a survivor's needs from such posts. But long posts can be time-consuming to read and respond to. Accordingly, we address the problem of extracting key information from a long MeToo post. We develop a natural language-based model to identify sentences from a post that describe any of the above three categories. On ten-fold…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Cancer-related gene regulation · Topic Modeling
