Training CNNs in Presence of JPEG Compression: Multimedia Forensics vs Computer Vision
Sara Mandelli, Nicol\`o Bonettini, Paolo Bestagini, Stefano Tubaro

TL;DR
This paper investigates how JPEG compression impacts CNN training for both forensic and computer vision tasks, emphasizing the importance of considering compression effects for forensic applications to maintain generalization.
Contribution
It highlights the different effects of JPEG compression on CNN training for forensic versus computer vision tasks and provides guidelines for dataset generation.
Findings
JPEG compression significantly affects forensic CNN training.
For computer vision tasks, JPEG effects can often be ignored.
Proper dataset generation considering JPEG effects improves forensic detector performance.
Abstract
Convolutional Neural Networks (CNNs) have proved very accurate in multiple computer vision image classification tasks that required visual inspection in the past (e.g., object recognition, face detection, etc.). Motivated by these astonishing results, researchers have also started using CNNs to cope with image forensic problems (e.g., camera model identification, tampering detection, etc.). However, in computer vision, image classification methods typically rely on visual cues easily detectable by human eyes. Conversely, forensic solutions rely on almost invisible traces that are often very subtle and lie in the fine details of the image under analysis. For this reason, training a CNN to solve a forensic task requires some special care, as common processing operations (e.g., resampling, compression, etc.) can strongly hinder forensic traces. In this work, we focus on the effect that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
