Specificity-aware reinforcement learning for fine-grained open-world classification

Samuele Angheben; Davide Berasi; Alessandro Conti; Elisa Ricci; Yiming Wang

arXiv:2603.03197·cs.CV·April 14, 2026

Specificity-aware reinforcement learning for fine-grained open-world classification

Samuele Angheben, Davide Berasi, Alessandro Conti, Elisa Ricci, Yiming Wang

PDF

1 Repo 4 Models

TL;DR

This paper introduces SpeciaRL, a reinforcement learning framework that enhances large multimodal models to produce more specific and accurate fine-grained classifications in open-world scenarios.

Contribution

It proposes a novel reinforcement learning approach with a verifier-based reward to improve specificity without losing correctness in open-world fine-grained classification.

Findings

01

Outperforms existing methods on fine-grained benchmarks

02

Achieves a better trade-off between correctness and specificity

03

Demonstrates effectiveness in out-of-domain experiments

Abstract

Classifying fine-grained visual concepts under open-world settings, i.e., without a predefined label set, demands models to be both accurate and specific. Recent reasoning Large Multimodal Models (LMMs) exhibit strong visual understanding capability but tend to produce overly generic predictions when performing fine-grained image classification. Our preliminary analysis reveals that models do possess the intrinsic fine-grained domain knowledge. However, promoting more specific predictions (specificity) without compromising correct ones (correctness) remains a non-trivial and understudied challenge. In this work, we investigate how to steer reasoning LMMs toward predictions that are both correct and specific. We propose a novel specificity-aware reinforcement learning framework, SpeciaRL, to fine-tune reasoning LMMs on fine-grained image classification under the open-world setting.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

s-angheben/SpeciaRL
github

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.