Fairness Begins with State: Purifying Latent Preferences for Hierarchical Reinforcement Learning in Interactive Recommendation

Yun Lu; Xiaoyu Shi; Hong Xie; Xiangyu Zhao; Mingsheng Shang

arXiv:2603.03820·cs.LG·March 5, 2026

Fairness Begins with State: Purifying Latent Preferences for Hierarchical Reinforcement Learning in Interactive Recommendation

Yun Lu, Xiaoyu Shi, Hong Xie, Xiangyu Zhao, Mingsheng Shang

PDF

Open Access

TL;DR

This paper introduces DSRM-HRL, a hierarchical reinforcement learning framework that purifies user state representations using diffusion models to improve fairness and utility in interactive recommender systems.

Contribution

It proposes a novel latent state purification method with diffusion models and decouples decision-making into hierarchical policies for fairness-aware recommendations.

Findings

01

Outperforms existing methods in fairness and utility trade-offs.

02

Effectively mitigates popularity bias and exposure inequality.

03

Demonstrates robustness in high-fidelity simulators.

Abstract

Interactive recommender systems (IRS) are increasingly optimized with Reinforcement Learning (RL) to capture the sequential nature of user-system dynamics. However, existing fairness-aware methods often suffer from a fundamental oversight: they assume the observed user state is a faithful representation of true preferences. In reality, implicit feedback is contaminated by popularity-driven noise and exposure bias, creating a distorted state that misleads the RL agent. We argue that the persistent conflict between accuracy and fairness is not merely a reward-shaping issue, but a state estimation failure. In this work, we propose \textbf{DSRM-HRL}, a framework that reformulates fairness-aware recommendation as a latent state purification problem followed by decoupled hierarchical decision-making. We introduce a Denoising State Representation Module (DSRM) based on diffusion models to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Mobile Crowdsensing and Crowdsourcing · Ethics and Social Impacts of AI