Partial Scene Text Retrieval
Hao Wang, Minghui Liao, Zhouyi Xie, Wenyu Liu, Xiang Bai

TL;DR
This paper introduces a novel method for partial scene text retrieval that can locate and search for partial text patches within scene images efficiently, using a shared feature space, ranking MIL, and a dynamic matching algorithm.
Contribution
It proposes a new network that retrieves both text-line instances and partial patches, employing RankMIL and DPMA to improve accuracy and efficiency without extra annotations.
Findings
Effective retrieval of partial text patches demonstrated.
RankMIL reduces noisy sample influence during training.
DPMA significantly speeds up partial patch search.
Abstract
The task of partial scene text retrieval involves localizing and searching for text instances that are the same or similar to a given query text from an image gallery. However, existing methods can only handle text-line instances, leaving the problem of searching for partial patches within these text-line instances unsolved due to a lack of patch annotations in the training data. To address this issue, we propose a network that can simultaneously retrieve both text-line instances and their partial patches. Our method embeds the two types of data (query text and scene text instances) into a shared feature space and measures their cross-modal similarities. To handle partial patches, our proposed approach adopts a Multiple Instance Learning (MIL) approach to learn their similarities with query text, without requiring extra annotations. However, constructing bags, which is a standard step…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques
