FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis
Zhe Chen, Zijing Chen

TL;DR
FLORA introduces a training-free, logic-based framework leveraging large language models for robust zero-shot object referring analysis, significantly improving performance without additional training.
Contribution
The paper presents FLORA, a novel formal language model that enables training-free zero-shot object referring analysis by integrating logical reasoning with large language models.
Findings
Boosts zero-shot performance of pretrained detectors by up to 45%
Outperforms current state-of-the-art zero-shot methods in detection and segmentation
Provides robustness and interpretability through probabilistic reasoning
Abstract
Object Referring Analysis (ORA), commonly known as referring expression comprehension, requires the identification and localization of specific objects in an image based on natural descriptions. Unlike generic object detection, ORA requires both accurate language understanding and precise visual localization, making it inherently more complex. Although recent pre-trained large visual grounding detectors have achieved significant progress, they heavily rely on extensively labeled data and time-consuming learning. To address these, we introduce a novel, training-free framework for zero-shot ORA, termed FLORA (Formal Language for Object Referring and Analysis). FLORA harnesses the inherent reasoning capabilities of large language models (LLMs) and integrates a formal language model - a logical framework that regulates language within structured, rule-based descriptions - to provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
