Deep Learning Mental Health Dialogue System
Lennart Brocki, George C. Dyer, Anna G{\l}adka, Neo Christopher Chung

TL;DR
This paper introduces Serena, a deep learning-based mental health dialogue system that aims to improve access to counseling by providing empathetic, engaging responses, while addressing challenges like coherence and hallucination.
Contribution
The paper presents a novel large-scale transformer-based dialogue system for mental health support, including post-processing algorithms to enhance response quality.
Findings
Serena can generate empathetic and engaging responses.
The system demonstrates potential as a low-cost supplement to human counseling.
Challenges like hallucination and incoherence are identified and partially addressed.
Abstract
Mental health counseling remains a major challenge in modern society due to cost, stigma, fear, and unavailability. We posit that generative artificial intelligence (AI) models designed for mental health counseling could help improve outcomes by lowering barriers to access. To this end, we have developed a deep learning (DL) dialogue system called Serena. The system consists of a core generative model and post-processing algorithms. The core generative model is a 2.7 billion parameter Seq2Seq Transformer fine-tuned on thousands of transcripts of person-centered-therapy (PCT) sessions. The series of post-processing algorithms detects contradictions, improves coherency, and removes repetitive answers. Serena is implemented and deployed on \url{https://serena.chat}, which currently offers limited free services. While the dialogue system is capable of responding in a qualitatively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Digital Mental Health Interventions · Machine Learning in Healthcare
MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Adam · Position-Wise Feed-Forward Layer · Softmax · Sigmoid Activation · Linear Layer · Absolute Position Encodings · Tanh Activation
