# Dynamic Deep Multi-modal Fusion for Image Privacy Prediction

**Authors:** Ashwini Tonge, Cornelia Caragea

arXiv: 1902.10796 · 2019-03-07

## TL;DR

This paper introduces a dynamic multi-modal fusion method that adaptively combines object, scene, and tag information from CNNs to improve online image privacy prediction accuracy.

## Contribution

It presents a novel approach that identifies the most competent modalities on the fly for each image, enhancing privacy prediction over existing models.

## Key findings

- Outperforms models trained on individual modalities.
- Achieves higher accuracy than prior privacy prediction methods.
- Surpasses strong baseline methods using meta-classifiers.

## Abstract

With millions of images that are shared online on social networking sites, effective methods for image privacy prediction are highly needed. In this paper, we propose an approach for fusing object, scene context, and image tags modalities derived from convolutional neural networks for accurately predicting the privacy of images shared online. Specifically, our approach identifies the set of most competent modalities on the fly, according to each new target image whose privacy has to be predicted. The approach considers three stages to predict the privacy of a target image, wherein we first identify the neighborhood images that are visually similar and/or have similar sensitive content as the target image. Then, we estimate the competence of the modalities based on the neighborhood images. Finally, we fuse the decisions of the most competent modalities and predict the privacy label for the target image. Experimental results show that our approach predicts the sensitive (or private) content more accurately than the models trained on individual modalities (object, scene, and tags) and prior privacy prediction works. Also, our approach outperforms strong baselines, that train meta-classifiers to obtain an optimal combination of modalities.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.10796/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/1902.10796/full.md

## References

61 references — full list in the complete paper: https://tomesphere.com/paper/1902.10796/full.md

---
Source: https://tomesphere.com/paper/1902.10796