Image Collation: Matching illustrations in manuscripts
Ryad Kaoua, Xi Shen, Alexandra Durr, Stavros Lazaris, David Picard,, Mathieu Aubry

TL;DR
This paper introduces the task of illustration collation in manuscripts, providing a large annotated dataset, analyzing current similarity measures, and demonstrating that cycle-consistent correspondences significantly improve matching accuracy.
Contribution
It presents the first large annotated dataset for illustration collation, evaluates existing similarity measures, and shows the effectiveness of cycle-consistent matching methods.
Findings
State-of-the-art similarity measures struggle with heavily modified illustrations.
Cycle-consistent correspondences significantly improve matching performance.
The dataset enables benchmarking and further research in manuscript illustration analysis.
Abstract
Illustrations are an essential transmission instrument. For an historian, the first step in studying their evolution in a corpus of similar manuscripts is to identify which ones correspond to each other. This image collation task is daunting for manuscripts separated by many lost copies, spreading over centuries, which might have been completely re-organized and greatly modified to adapt to novel knowledge or belief and include hundreds of illustrations. Our contributions in this paper are threefold. First, we introduce the task of illustration collation and a large annotated public dataset to evaluate solutions, including 6 manuscripts of 2 different texts with more than 2 000 illustrations and 1 200 annotated correspondences. Second, we analyze state of the art similarity measures for this task and show that they succeed in simple cases but struggle for large manuscripts when the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Digital Humanities and Scholarship · Generative Adversarial Networks and Image Synthesis
