Image-guided topic modeling for interpretable privacy classification
Alina Elena Baia, Andrea Cavallaro

TL;DR
This paper introduces a novel image-guided topic modeling approach that generates natural language descriptors to predict and explain image privacy, enhancing interpretability and performance over existing methods.
Contribution
The paper presents a new multimodal image-guided topic modeling technique (ITM) that produces interpretable descriptors for privacy prediction, outperforming previous interpretable models.
Findings
Priv×ITM classifier outperforms existing interpretable methods by 5% in accuracy.
Priv×ITM performs comparably to non-interpretable state-of-the-art models.
ITM effectively leverages vision and language data for privacy classification.
Abstract
Predicting and explaining the private information contained in an image in human-understandable terms is a complex and contextual task. This task is challenging even for large language models. To facilitate the understanding of privacy decisions, we propose to predict image privacy based on a set of natural language content descriptors. These content descriptors are associated with privacy scores that reflect how people perceive image content. We generate descriptors with our novel Image-guided Topic Modeling (ITM) approach. ITM leverages, via multimodality alignment, both vision information and image textual descriptions from a vision language model. We use the ITM-generated descriptors to learn a privacy predictor, PrivITM, whose decisions are interpretable by design. Our PrivITM classifier outperforms the reference interpretable method by 5 percentage points in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital and Cyber Forensics
MethodsSparse Evolutionary Training
