Blind Dates: Examining the Expression of Temporality in Historical Photographs
Alexandra Barancov\'a, Melvin Wevers, Nanne van Noord

TL;DR
This study evaluates the ability of OpenCLIP, a multi-modal vision-language model, to date historical photographs, finding that fine-tuning improves accuracy and reveals visual cues associated with specific time periods.
Contribution
It demonstrates the effectiveness of fine-tuning OpenCLIP for image dating and identifies visual markers that aid temporal classification in historical photographs.
Findings
Zero-shot classification is less effective and biased towards past dates.
Fine-tuning improves accuracy and reduces bias.
Images with vehicles, animals, and people are more accurately dated.
Abstract
This paper explores the capacity of computer vision models to discern temporal information in visual content, focusing specifically on historical photographs. We investigate the dating of images using OpenCLIP, an open-source implementation of CLIP, a multi-modal language and vision model. Our experiment consists of three steps: zero-shot classification, fine-tuning, and analysis of visual content. We use the \textit{De Boer Scene Detection} dataset, containing 39,866 gray-scale historical press photographs from 1950 to 1999. The results show that zero-shot classification is relatively ineffective for image dating, with a bias towards predicting dates in the past. Fine-tuning OpenCLIP with a logistic classifier improves performance and eliminates the bias. Additionally, our analysis reveals that images featuring buses, cars, cats, dogs, and people are more accurately dated, suggesting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods
MethodsContrastive Language-Image Pre-training
