Can Large Language Models Provide Security & Privacy Advice? Measuring   the Ability of LLMs to Refute Misconceptions

Yufan Chen; Arjun Arunasalam; Z. Berkay Celik

arXiv:2310.02431·cs.HC·October 5, 2023·1 cites

Can Large Language Models Provide Security & Privacy Advice? Measuring the Ability of LLMs to Refute Misconceptions

Yufan Chen, Arjun Arunasalam, Z. Berkay Celik

PDF

Open Access 1 Repo

TL;DR

This study evaluates the ability of large language models like Bard and ChatGPT to accurately refute common security and privacy misconceptions, revealing significant error rates and issues with source reliability.

Contribution

The paper systematically measures LLMs' effectiveness in correcting security misconceptions, highlighting their limitations and potential risks in providing trustworthy advice.

Findings

01

LLMs have an average 21.3% error rate in refuting misconceptions.

02

Error rate increases to 32.6% with paraphrased or repeated queries.

03

Models often provide invalid or unrelated source URLs.

Abstract

Users seek security & privacy (S&P) advice from online resources, including trusted websites and content-sharing platforms. These resources help users understand S&P technologies and tools and suggest actionable strategies. Large Language Models (LLMs) have recently emerged as trusted information sources. However, their accuracy and correctness have been called into question. Prior research has outlined the shortcomings of LLMs in answering multiple-choice questions and user ability to inadvertently circumvent model restrictions (e.g., to produce toxic content). Yet, the ability of LLMs to provide reliable S&P advice is not well-explored. In this paper, we measure their ability to refute popular S&P misconceptions that the general public holds. We first study recent academic literature to curate a dataset of over a hundred S&P-related misconceptions across six different topics. We then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

purseclab/llm_security_privacy_advice
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Hate Speech and Cyberbullying Detection