Editorial: Machine learning approaches for monitoring mental health and substance abuse using social media data
Danila Azzolina, Erica Secchettin, Paola Berchialla, Tingshao Zhu, Aslihan Şentürk Acar, Dario Gregori

Abstract
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Digital Mental Health Interventions · Social Media in Health Education
The rapid growth of social media platforms has changed how individuals express and share personal habits and experiences, including those related to mental health and substance use (1). In parallel, advances in machine learning (ML) have provided powerful tools for analyzing this rich and abundant data source, creating new possibilities for public health research (2).
This Research Topic brings together several studies that explore the intersection of these two domains, employing ML techniques to monitor, analyze, and predict mental health conditions and substance abuse patterns using social media data, illustrating how innovative approaches could impact public health monitoring through the lens of social media data.
Mental health perspective
The mental health-focused articles in this collection emphasize how machine learning and social media data can be harnessed to monitor public attitudes, predict negative mental health outcomes, and explore the potential of AI-driven tools in improving access to mental health care, offering new perspectives on early intervention and support strategies. In this framework, Li tackles the prediction of negative attitudes toward suicide on the Chinese social media platform Sina Weibo. Combining psycholinguistic features with machine learning models, the study provides a framework for identifying public attitudes toward suicide. Unlike previous research that focused primarily on stigmatizing language, this study covers a broader range of negative attitudes, such as dismissive and indifferent tones, offering a more in-depth understanding of public discourse. The model's high predictive accuracy, combined with its rigorous external validation, underlines the importance of incorporating diverse forms of negative attitudes into predictive frameworks for mental health monitoring (Li). This approach is important for reducing stigma and promoting informed, compassionate conversations around the mental health.
Integration of artificial intelligence into mental health care is further illustrated by Grosshans et al., who present a case study on the use of conversation-based AI (CAI) for managing anxiety disorders. The patient in this study initially uses a chatbot for self-management of social anxiety, gradually transitioning to human-based psychiatric care. This case highlights how technology can lower the threshold for seeking traditional medical assistance, particularly for individuals who might otherwise avoid professional help due to the stigma or discomfort. The study also emphasizes the importance of blending AI-driven support with human expertise to ensure safety and effectiveness in care (Grosshans et al.). While the chatbot provides valuable psychoeducation and therapeutic suggestions, the human intervention helps to validate these strategies and offers ongoing guidance, creating a complementary care model.
Substance use perspective
The articles on substance use highlight the power of machine learning to analyze social media discussions, providing real-time insights into public perceptions, regional patterns, and emerging trends related to tobacco consumption and drug safety, ultimately supporting more targeted public health interventions.
An important theme emerging from these studies is the use of natural language processing (NLP) to analyze unstructured text data from social media. For example, Dong et al. developed a BERT-based language model to extract drug-related adverse events from social media, addressing the limitations of traditional pharmacovigilance systems, such as the FDA's Adverse Event Reporting System (FAERS). Their findings highlight the potential of social media to complement existing surveillance systems by offering immediate insights into adverse drug reactions. The model achieves high F1 scores in detecting adverse event mentions across multiple datasets, demonstrating its robustness and practical utility in pharmacovigilance practices (Dong et al.). This approach not only facilitates real-time monitoring capabilities but also bridges the temporal gap inherent in conventional reporting systems, offering a more proactive response to emerging safety issues.
Similarly, Castillo-Toledo et al. explore the content of tobacco-related tweets, providing regional insights into public perceptions and user opinions. Their study analyzes more than 56,000 tweets in English and Spanish, revealing that healthcare professionals are the most frequent contributors to the discussion, often promoting accurate and evidence-based health information. The geolocation data revealed regional disparities, with Africa showing the lowest engagement in discussions about the health risks of tobacco. This finding aligns with the continent's relatively underdeveloped anti-tobacco policies, emphasizing the importance of geolocational analyses for identifying public health gaps and tailoring interventions accordingly (Castillo-Toledo et al.). This work demonstrates how social media can serve as a mirror of public health awareness and policy impact across different regions.
Together, these studies reflect the potential of machine learning applications to social media mining to facilitate public health monitoring and intervention strategies. Using social media data, researchers can gain real-time insights into public health trends, track evolving behaviors, and detect emerging risks.
Challenges and future directions
The use of social media data comes with its own set of challenges. Data quality and representativeness remain significant issues, as social media content is often noisy and culturally biased. This can affect the accuracy and generalizability of machine learning models (3). Furthermore, ethical considerations around privacy, informed consent, and the potential misuse of predictive models must be carefully addressed to ensure responsible and fair applications (4).
Future research should prioritize the development of adaptive models capable of handling the dynamic nature of social media content while maintaining transparency and fairness in predictions. Collaborative efforts between data scientists, public health professionals, and policymakers are important for translating these technological advances into actionable strategies that can improve health outcomes. Moreover, integrating machine learning insights into existing health infrastructures will require the development of clear ethical guidelines and standardized protocols.
In conclusion, the contributions to this Research Topic offer a snapshot of how machine learning reshapes public health surveillance through social media. From monitoring adverse drug reactions to analyze public attitudes toward suicide and integrating AI into mental health care, these studies highlight the transformative potential of combining computational methods with digital data sources. We hope that this collection inspires further interdisciplinary collaboration and innovation, ultimately contributing to more effective, data-driven public health solutions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Naslund JA Bondre A Torous J Aschbrenner KA. Social media and mental health: benefits, risks, and opportunities for research and practice. J Technol Behav Sci. (2020) 5:245–57. 10.1007/s 41347-020-00134-x 33415185 PMC 7785056 · doi ↗ · pubmed ↗
- 2Olawade DB Wada OJ David-Olawade AC Kunonga E Abaire O Ling J. Using artificial intelligence to improve public health: a narrative review. Front Public Health. (2023) 11:1196397. 10.3389/fpubh.2023.119639737954052 PMC 10637620 · doi ↗ · pubmed ↗
- 3Srivastava AK Mishra R. Analyzing Social Media Research: A Data quality and research reproducibility perspective. IIM Kozhikode Soc Manag Rev. (2023) 12:39–49. 10.1177/22779752211011810 · doi ↗
- 4Harishbhai Tilala M Kumar Chenchala P Choppadandi A Kaur J Naguri S Saoji R. Ethical considerations in the use of artificial intelligence and machine learning in health care: a comprehensive review. Cureus. (2024) 16:e 62443. 10.7759/cureus.6244339011215 PMC 11249277 · doi ↗ · pubmed ↗
