STaR: Sensitive Trajectory Regulation for Unlearning in Large Reasoning Models

Jingjing Zhou; Gaoxiang Cong; Li Su; Liang Li

arXiv:2601.09281·cs.AI·January 15, 2026

STaR: Sensitive Trajectory Regulation for Unlearning in Large Reasoning Models

Jingjing Zhou, Gaoxiang Cong, Li Su, Liang Li

PDF

Open Access 1 Video

TL;DR

This paper introduces STaR, a novel inference-time framework for unlearning sensitive information in large reasoning models, ensuring privacy throughout the reasoning process without significant utility loss.

Contribution

STaR is a parameter-free, trajectory-aware unlearning method that effectively removes sensitive content from reasoning chains in LRMs, addressing limitations of prior answer-focused approaches.

Findings

01

Achieves robust privacy protection across reasoning trajectories.

02

Maintains high utility with minimal performance degradation.

03

Outperforms existing methods on the R-TOFU benchmark.

Abstract

Large Reasoning Models (LRMs) have advanced automated multi-step reasoning, but their ability to generate complex Chain-of-Thought (CoT) trajectories introduces severe privacy risks, as sensitive information may be deeply embedded throughout the reasoning process. Existing Large Language Models (LLMs) unlearning approaches that typically focus on modifying only final answers are insufficient for LRMs, as they fail to remove sensitive content from intermediate steps, leading to persistent privacy leakage and degraded security. To address these challenges, we propose Sensitive Trajectory Regulation (STaR), a parameter-free, inference-time unlearning framework that achieves robust privacy protection throughout the reasoning process. Specifically, we first identify sensitive content via semantic-aware detection. Then, we inject global safety constraints through secure prompt prefix. Next,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

STaR: Sensitive Trajectory Regulation for Unlearning in Large Reasoning Models· underline

Taxonomy

TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Advanced Graph Neural Networks