Safe in the Future, Dangerous in the Past: Dissecting Temporal and Linguistic Vulnerabilities in LLMs
Muhammad Abdullahi Said, Muhammad Sammani Sani

TL;DR
This paper systematically audits state-of-the-art LLMs revealing complex interactions between language and temporal framing that affect safety, challenging assumptions of multilingual safety transfer and exposing vulnerabilities in temporal reasoning.
Contribution
It introduces HausaSafety, a novel adversarial dataset, and uncovers the Temporal Asymmetry in LLM safety, proposing Invariant Alignment for improved robustness.
Findings
Claude 4.5 Opus safer in Hausa than English due to uncertainty-driven refusal
Past-tense scenarios bypass safety defenses, future-tense trigger hyper-conservative responses
Safety varies significantly with context, with a 9.2x disparity between configurations
Abstract
As Large Language Models (LLMs) integrate into critical global infrastructure, the assumption that safety alignment transfers zero-shot from English to other languages remains a dangerous blind spot. This study presents a systematic audit of three state of the art models (GPT-5.1, Gemini 3 Pro, and Claude 4.5 Opus) using HausaSafety, a novel adversarial dataset grounded in West African threat scenarios (e.g., Yahoo-Yahoo fraud, Dane gun manufacturing). Employing a 2 x 4 factorial design across 1,440 evaluations, we tested the non-linear interaction between language (English vs. Hausa) and temporal framing. Our results challenge the narrative of the multilingual safety gap. Instead of a simple degradation in low-resource settings, we identified a complex interference mechanism in which safety is determined by the intersection of variables. Although the models exhibited a reverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Computational and Text Analysis Methods
