Detecting sub-populations in online health communities: A mixed-methods exploration of breastfeeding messages in BabyCenter Birth Clubs
Calla Beauregard, Parisa Suchdev, Ashley M. A. Fehr, Isabelle T. Smith, Tabia Tanzin Prama, Julia Witte Zimmerman, Carter Ward, Juniper Lovato, Christopher M. Danforth, and Peter Sheridan Dodds

TL;DR
This study analyzes over 5 million posts from BabyCenter birth clubs to identify sub-populations and themes, revealing increased anxiety-related discussions and dominant topics like sleep and work among breastfeeding groups.
Contribution
It introduces a mixed-methods approach combining BERTopic and LDA to detect and compare sub-populations and themes in large online health community datasets.
Findings
Anxiety-related posts increased steadily from 2017 to 2024.
Sleep is the dominant topic in breastfeeding-related content.
Breastfeeding groups focus more on sleep, anxiety, and work topics.
Abstract
Parental stress is a nationwide health crisis according to the U.S. Surgeon General's 2024 advisory. To allay stress, expecting parents seek advice and share experiences in a variety of venues, from in-person birth education classes and parenting groups to virtual communities, for example, BabyCenter, a moderated online forum community with over 4 million members in the United States alone. In this study, we aim to understand how parents talk about pregnancy, birth, and parenting by analyzing 5.43M posts and comments from the April 2017--January 2024 cohort of 331,843 BabyCenter "birth club" users (that is, users who participate in due date forums or "birth clubs" based on their babies' due dates). Using BERTopic to locate breastfeeding threads and LDA to summarize themes, we compare documents in breastfeeding threads to all other birth-club content. Analyzing time series of word rank,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
