WellDunn: On the Robustness and Explainability of Language Models and Large Language Models in Identifying Wellness Dimensions
Seyedali Mohammadi, Edward Raff, Jinendra Malekar, Vedant Palit,, Francis Ferraro, and Manas Gaur

TL;DR
This paper evaluates the robustness and explainability of language models in identifying wellness dimensions, revealing significant gaps in attention fidelity and domain-specific knowledge, which impact their clinical trustworthiness.
Contribution
It introduces an evaluation framework for assessing LMs' robustness and explainability in mental health, highlighting critical shortcomings in attention alignment and domain knowledge.
Findings
GPT-3.5/4 lag behind RoBERTa and MedAlpaca in performance and explanations.
Confidence-based re-evaluation causes performance drops.
Attention-explanation alignment is very low across models.
Abstract
Language Models (LMs) are being proposed for mental health applications where the heightened risk of adverse outcomes means predictive performance may not be a sufficient litmus test of a model's utility in clinical practice. A model that can be trusted for practice should have a correspondence between explanation and clinical determination, yet no prior research has examined the attention fidelity of these models and their effect on ground truth explanations. We introduce an evaluation design that focuses on the robustness and explainability of LMs in identifying Wellness Dimensions (WDs). We focus on two existing mental health and well-being datasets: (a) Multi-label Classification-based MultiWD, and (b) WellXplain for evaluating attention mechanism veracity against expert-labeled explanations. The labels are based on Halbert Dunn's theory of wellness, which gives grounding to our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · WordPiece · Linear Warmup With Linear Decay · Cosine Annealing · BERT · Residual Connection · Softmax · RoBERTa · Layer Normalization
