Is On-Device AI Broken and Exploitable? Assessing the Trust and Ethics in Small Language Models
Kalyan Nakka, Jimmy Dani, Nitesh Saxena

TL;DR
This study evaluates the trustworthiness and ethical safety of small language models on personal devices, revealing significant vulnerabilities, biases, and potential for misuse compared to cloud-based models.
Contribution
First comprehensive assessment of trust and ethics in on-device small language models, highlighting their vulnerabilities and risks for the first time.
Findings
On-device SLMs are less trustworthy than server-based models.
On-device SLMs lack ethical safeguards, generating harmful content.
On-device SLMs can be exploited with simple prompts to produce unethical responses.
Abstract
In this paper, we present a very first study to investigate trust and ethical implications of on-device artificial intelligence (AI), focusing on small language models (SLMs) amenable for personal devices like smartphones. While on-device SLMs promise enhanced privacy, reduced latency, and improved user experience compared to cloud-based services, we posit that they might also introduce significant risks and vulnerabilities compared to their on-server counterparts. As part of our trust assessment study, we conduct a systematic evaluation of the state-of-the-art on-devices SLMs, contrasted to their on-server counterparts, based on a well-established trustworthiness measurement framework. Our results show on-device SLMs to be significantly less trustworthy, specifically demonstrating more stereotypical, unfair and privacy-breaching behavior. Informed by these findings, we then perform our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Artificial Intelligence in Healthcare and Education
