MARS: Paying more attention to visual attributes for text-based person search
Alex Ergasti, Tomaso Fontanini, Claudio Ferrari, Massimo Bertozzi,, Andrea Prati

TL;DR
This paper introduces MARS, a novel architecture for text-based person search that improves representation learning by focusing on visual attributes and balancing attribute contributions, leading to significant performance gains.
Contribution
The paper proposes MARS, a new TBPS model with a Masked AutoEncoder and Attribute Loss to enhance visual-attribute representation and address intra- and inter-identity challenges.
Findings
Significant improvements in mAP on CUHK-PEDES, ICFG-PEDES, and RSTPReid datasets.
Enhanced representation learning through Masked AutoEncoder training.
Better attribute balancing results in improved retrieval accuracy.
Abstract
Text-based person search (TBPS) is a problem that gained significant interest within the research community. The task is that of retrieving one or more images of a specific individual based on a textual description. The multi-modal nature of the task requires learning representations that bridge text and image data within a shared latent space. Existing TBPS systems face two major challenges. One is defined as inter-identity noise that is due to the inherent vagueness and imprecision of text descriptions and it indicates how descriptions of visual attributes can be generally associated to different people; the other is the intra-identity variations, which are all those nuisances e.g. pose, illumination, that can alter the visual appearance of the same textual attributes for a given subject. To address these issues, this paper presents a novel TBPS architecture named MARS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis · Data-Driven Disease Surveillance · Geographic Information Systems Studies
