Writer Identification and Writer Retrieval Based on NetVLAD with Re-ranking
Shervin Rasoulzadeh, Bagher Babaali

TL;DR
This paper introduces a novel neural network pipeline combining ResNet-20, NetVLAD, and re-ranking for improved writer identification and retrieval, achieving state-of-the-art results on multiple datasets.
Contribution
The work presents a unified neural network architecture with a new re-ranking strategy for writer identification and retrieval, enhancing performance over existing methods.
Findings
Superior performance on ICDAR 2013 and CVL datasets in terms of mAP.
Comparable results to state-of-the-art on KHATT dataset.
Effective use of re-ranking with $k$-reciprocal nearest neighbors.
Abstract
This paper addresses writer identification and writer retrieval which is considered as a challenging problem in the document analysis and recognition field. In this work, a novel pipeline is proposed for the problem at hand by employing a unified neural network architecture consisting of the ResNet-20 as a feature extractor and an integrated NetVLAD layer, inspired by the vector of locally aggregated descriptors (VLAD), in the head of the latter part. Having defined this architecture, the triplet semi-hard loss function is used to directly learn an embedding for individual input image patches. Subsequently, generalized max-pooling technique is employed for the aggregation of embedded descriptors of each handwritten image. Also, a novel re-ranking strategy is introduced for the task of identification and retrieval based on -reciprocal nearest neighbors, and it is shown that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
