Transductive Learning for Near-Duplicate Image Detection in Scanned   Photo Collections

Francesc Net; Marc Folia; Pep Casals; and Lluis Gomez

arXiv:2410.19437·cs.CV·October 28, 2024

Transductive Learning for Near-Duplicate Image Detection in Scanned Photo Collections

Francesc Net, Marc Folia, Pep Casals, and Lluis Gomez

PDF

Open Access

TL;DR

This paper introduces a transductive learning method utilizing deep neural networks and Vision Transformers to improve near-duplicate image detection in scanned photo collections, reducing manual annotation effort.

Contribution

It proposes a novel transductive learning approach that fine-tunes pre-trained models on unlabeled target data for near-duplicate detection, outperforming baseline methods.

Findings

01

Outperforms baseline methods on UKBench dataset

02

Effective in reducing manual annotation time

03

Leverages self-supervised fine-tuning on unlabeled data

Abstract

This paper presents a comparative study of near-duplicate image detection techniques in a real-world use case scenario, where a document management company is commissioned to manually annotate a collection of scanned photographs. Detecting duplicate and near-duplicate photographs can reduce the time spent on manual annotation by archivists. This real use case differs from laboratory settings as the deployment dataset is available in advance, allowing the use of transductive learning. We propose a transductive learning approach that leverages state-of-the-art deep learning architectures such as convolutional neural networks (CNNs) and Vision Transformers (ViTs). Our approach involves pre-training a deep neural network on a large dataset and then fine-tuning the network on the unlabeled target collection with self-supervised learning. The results show that the proposed approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Digital Media Forensic Detection · Image Processing Techniques and Applications