Assessing LLM Response Quality in the Context of Technology-Facilitated Abuse
Vijay Prakash, Majed Almansoori, Donghan Hu, Rahul Chatterjee, Danny Yuxing Huang

TL;DR
This study evaluates the effectiveness of four large language models in responding to technology-facilitated abuse questions from survivors, using expert assessments and user feedback to identify strengths and limitations.
Contribution
First comprehensive expert and user evaluation of LLM responses to TFA-related questions, providing insights for future model development and domain-specific fine-tuning.
Findings
LLMs show potential but have notable limitations in TFA contexts.
Expert assessments highlight areas for response improvement.
User feedback emphasizes the importance of perceived actionability.
Abstract
Technology-facilitated abuse (TFA) is a pervasive form of intimate partner violence (IPV) that leverages digital tools to control, surveil, or harm survivors. While tech clinics are one of the reliable sources of support for TFA survivors, they face limitations due to staffing constraints and logistical barriers. As a result, many survivors turn to online resources for assistance. With the growing accessibility and popularity of large language models (LLMs), and increasing interest from IPV organizations, survivors may begin to consult LLM-based chatbots before seeking help from tech clinics. In this work, we present the first expert-led manual evaluation of four LLMs - two widely used general-purpose non-reasoning models and two domain-specific models designed for IPV contexts - focused on their effectiveness in responding to TFA-related questions. Using real-world questions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntimate Partner and Family Violence · Digital Mental Health Interventions · AI in Service Interactions
