Evaluating Apple Intelligence's Writing Tools for Privacy Against Large Language Model-Based Inference Attacks: Insights from Early Datasets
Mohd. Farhan Israk Soumik, Syed Mhamudul Hasan, Abdur R. Shahid

TL;DR
This paper empirically evaluates how Apple Intelligence's writing tools can modify text to reduce emotion inference attacks by LLMs, highlighting their potential as privacy-preserving mechanisms on user devices.
Contribution
It introduces early datasets and provides the first empirical analysis of Apple Intelligence's text modifications for privacy against LLM-based inference attacks.
Findings
Text modifications can significantly reduce emotion inference by LLMs.
Apple's writing tools show promise as privacy-preserving mechanisms.
Foundations for adaptive rewriting systems to protect user privacy.
Abstract
The misuse of Large Language Models (LLMs) to infer emotions from text for malicious purposes, known as emotion inference attacks, poses a significant threat to user privacy. In this paper, we investigate the potential of Apple Intelligence's writing tools, integrated across iPhone, iPad, and MacBook, to mitigate these risks through text modifications such as rewriting and tone adjustment. By developing early novel datasets specifically for this purpose, we empirically assess how different text modifications influence LLM-based detection. This capability suggests strong potential for Apple Intelligence's writing tools as privacy-preserving mechanisms. Our findings lay the groundwork for future adaptive rewriting systems capable of dynamically neutralizing sensitive emotional content to enhance user privacy. To the best of our knowledge, this research provides the first empirical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Hate Speech and Cyberbullying Detection · Spam and Phishing Detection
