# Multi-branch network for double JPEG detection and localization

**Authors:** Ahmed M. Fouad, Hala H. Zayed, Ahmed Taha

PMC · DOI: 10.1038/s41598-025-04203-0 · Scientific Reports · 2025-06-03

## TL;DR

This paper introduces a multi-branch neural network that improves the detection and localization of double JPEG compression in images, outperforming existing methods.

## Contribution

The novel multi-branch architecture captures both inter- and intra-band DCT correlations for enhanced double JPEG detection.

## Key findings

- The multi-branch model achieves 94.15% accuracy on the Park dataset, surpassing state-of-the-art methods.
- The model excels in localizing manipulated regions in real-world images.
- Adding intra-branches improves performance on complex datasets with diverse quantization tables.

## Abstract

Recently, the accessibility and user-friendly nature of image editing tools have increased, allowing even inexperienced users to create and share forged images. Therefore, developing forensic methods to detect forged images is crucial. JPEG image tampering often involves recompression with a different quantization table, known as double JPEG compression. This paper proposes a multi-branch convolutional neural network and compares it with single-branch models to demonstrate its effectiveness in detecting double JPEG compression. The network consists of inter-branches, capturing statistical correlations across all Discrete Cosine Transform (DCT) frequency bands, and intra-branches, focusing on within-band correlations. By increasing feature extraction through additional intra-branches, the system enhances detection performance, particularly in complex datasets with diverse quantization tables. Features are concatenated with the image quantization table to improve robustness across varying quantization table combinations. Evaluated on the Park dataset, which includes over a million JPEG patches and 1,120 randomly assigned quantization tables, the proposed multi-branch model outperforms single-branch architectures (VGG16, DenseNet121, ResNet50) and surpasses state-of-the-art methods with a 94.15% accuracy. Furthermore, it demonstrates superior performance in localizing manipulated regions in real-world images.

## Full-text entities

- **Diseases:** DJPEG (MESH:D005671)
- **Chemicals:** DCT (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12134093/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12134093/full.md

## References

16 references — full list in the complete paper: https://tomesphere.com/paper/PMC12134093/full.md

---
Source: https://tomesphere.com/paper/PMC12134093