WellDunn: On the Robustness and Explainability of Language Models and   Large Language Models in Identifying Wellness Dimensions

Seyedali Mohammadi; Edward Raff; Jinendra Malekar; Vedant Palit,; Francis Ferraro; and Manas Gaur

arXiv:2406.12058·cs.AI·December 5, 2024

WellDunn: On the Robustness and Explainability of Language Models and Large Language Models in Identifying Wellness Dimensions

Seyedali Mohammadi, Edward Raff, Jinendra Malekar, Vedant Palit,, Francis Ferraro, and Manas Gaur

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the robustness and explainability of language models in identifying wellness dimensions, revealing significant gaps in attention fidelity and domain-specific knowledge, which impact their clinical trustworthiness.

Contribution

It introduces an evaluation framework for assessing LMs' robustness and explainability in mental health, highlighting critical shortcomings in attention alignment and domain knowledge.

Findings

01

GPT-3.5/4 lag behind RoBERTa and MedAlpaca in performance and explanations.

02

Confidence-based re-evaluation causes performance drops.

03

Attention-explanation alignment is very low across models.

Abstract

Language Models (LMs) are being proposed for mental health applications where the heightened risk of adverse outcomes means predictive performance may not be a sufficient litmus test of a model's utility in clinical practice. A model that can be trusted for practice should have a correspondence between explanation and clinical determination, yet no prior research has examined the attention fidelity of these models and their effect on ground truth explanations. We introduce an evaluation design that focuses on the robustness and explainability of LMs in identifying Wellness Dimensions (WDs). We focus on two existing mental health and well-being datasets: (a) Multi-label Classification-based MultiWD, and (b) WellXplain for evaluating attention mechanism veracity against expert-labeled explanations. The labels are based on Halbert Dunn's theory of wellness, which gives grounding to our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vedantpalit/WellDunn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · WordPiece · Linear Warmup With Linear Decay · Cosine Annealing · BERT · Residual Connection · Softmax · RoBERTa · Layer Normalization