T-QPM: Enabling Temporal Out-Of-Distribution Detection and Domain Generalization for Vision-Language Models in Open-World

Aditi Naiknaware; Salimeh Sekeh

arXiv:2603.18481·cs.CV·March 20, 2026

T-QPM: Enabling Temporal Out-Of-Distribution Detection and Domain Generalization for Vision-Language Models in Open-World

Aditi Naiknaware, Salimeh Sekeh

PDF

Open Access

TL;DR

This paper introduces T-QPM, a novel framework that enhances vision-language models' ability to detect out-of-distribution data and handle temporal distribution shifts in open-world scenarios through dynamic, cross-modal reasoning and regularization.

Contribution

The paper proposes T-QPM, extending dual-pattern matching with temporal quadruple-patterns and adaptive fusion weights to improve OOD detection and robustness in evolving environments.

Findings

01

Outperforms static baselines on temporally partitioned benchmarks.

02

Effectively detects OOD data in non-stationary environments.

03

Enhances robustness against covariate shifts in vision-language models.

Abstract

Out-of-distribution (OOD) detection remains a critical challenge in open-world learning, where models must adapt to evolving data distributions. While recent vision-language models (VLMS) like CLIP enable multimodal OOD detection through Dual-Pattern Matching (DPM), existing methods typically suffer from two major shortcomings: (1) They rely on fixed fusion rules and assume static environments, failing under temporal drift; and (2) they lack robustness against covariate shifted inputs. In this paper, we propose a novel two-step framework to enhance OOD detection and covariate distribution shift robustness in dynamic settings. We extend the dual-pattern regime into Temporal Quadruple-Pattern Matching (T-QPM). First, by pairing OOD images with text descriptions, we introduce cross-modal consistency patterns between ID and OOD signals, refining the decision boundary through joint…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications