Safe in the Future, Dangerous in the Past: Dissecting Temporal and Linguistic Vulnerabilities in LLMs

Muhammad Abdullahi Said; Muhammad Sammani Sani

arXiv:2512.24556·cs.CL·January 9, 2026

Safe in the Future, Dangerous in the Past: Dissecting Temporal and Linguistic Vulnerabilities in LLMs

Muhammad Abdullahi Said, Muhammad Sammani Sani

PDF

Open Access

TL;DR

This paper systematically audits state-of-the-art LLMs revealing complex interactions between language and temporal framing that affect safety, challenging assumptions of multilingual safety transfer and exposing vulnerabilities in temporal reasoning.

Contribution

It introduces HausaSafety, a novel adversarial dataset, and uncovers the Temporal Asymmetry in LLM safety, proposing Invariant Alignment for improved robustness.

Findings

01

Claude 4.5 Opus safer in Hausa than English due to uncertainty-driven refusal

02

Past-tense scenarios bypass safety defenses, future-tense trigger hyper-conservative responses

03

Safety varies significantly with context, with a 9.2x disparity between configurations

Abstract

As Large Language Models (LLMs) integrate into critical global infrastructure, the assumption that safety alignment transfers zero-shot from English to other languages remains a dangerous blind spot. This study presents a systematic audit of three state of the art models (GPT-5.1, Gemini 3 Pro, and Claude 4.5 Opus) using HausaSafety, a novel adversarial dataset grounded in West African threat scenarios (e.g., Yahoo-Yahoo fraud, Dane gun manufacturing). Employing a 2 x 4 factorial design across 1,440 evaluations, we tested the non-linear interaction between language (English vs. Hausa) and temporal framing. Our results challenge the narrative of the multilingual safety gap. Instead of a simple degradation in low-resource settings, we identified a complex interference mechanism in which safety is determined by the intersection of variables. Although the models exhibited a reverse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Computational and Text Analysis Methods