Permissive Information-Flow Analysis for Large Language Models
Shoaib Ahmed Siddiqui, Radhika Gaonkar, Boris K\"opf, David Krueger, Andrew Paverd, Ahmed Salem, Shruti Tople, Lukas Wutschitz, Menglin Xia, Santiago Zanella-B\'eguelin

TL;DR
This paper introduces a permissive information-flow analysis method for large language models that selectively propagates influential input labels, enhancing security and privacy without being overly conservative.
Contribution
It proposes a novel, less restrictive label propagation technique for LLMs, improving security analysis by focusing on influential inputs rather than all data.
Findings
Improves information flow analysis in LLMs in over 85% of cases.
Demonstrates effectiveness of prompt-based and k-NN based label propagation.
Outperforms baseline introspection method in experimental evaluations.
Abstract
Large Language Models (LLMs) are rapidly becoming commodity components of larger software systems. This poses natural security and privacy problems: poisoned data retrieved from one component can change the model's behavior and compromise the entire system, including coercing the model to spread confidential data to untrusted components. One promising approach is to tackle this problem at the system level via dynamic information flow (aka taint) tracking. Unfortunately, this approach of propagating the most restrictive input label to the output is too conservative for applications where LLMs operate on inputs retrieved from diverse sources. In this paper, we propose a novel, more permissive approach to propagate information flow labels through LLM queries. The key idea behind our approach is to propagate only the labels of the samples that were influential in generating the model output…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Data Quality and Management
