IGD: Token Decisiveness Modeling via Information Gain in LLMs for Personalized Recommendation

Zijie Lin; Yang Zhang; Xiaoyan Zhao; Fengbin Zhu; Fuli Feng; Tat-Seng Chua

arXiv:2506.13229·cs.CL·October 31, 2025

IGD: Token Decisiveness Modeling via Information Gain in LLMs for Personalized Recommendation

Zijie Lin, Yang Zhang, Xiaoyan Zhao, Fengbin Zhu, Fuli Feng, Tat-Seng Chua

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method called IGD that models token decisiveness in LLMs for recommendation, using information gain to improve token handling during training and decoding, leading to better recommendation accuracy.

Contribution

The paper proposes a new perspective on token decisiveness using information gain, and develops IGD, a strategy that enhances LLM-based recommendation by emphasizing high-decisiveness tokens.

Findings

01

IGD improves recommendation accuracy across four benchmark datasets.

02

Tokens with low information gain often dominate training but contribute little to discrimination.

03

Rebalancing token importance based on IG enhances model performance.

Abstract

Large Language Models (LLMs) have shown strong potential for recommendation by framing item prediction as a token-by-token language generation task. However, existing methods treat all item tokens equally, simply pursuing likelihood maximization during both optimization and decoding. This overlooks crucial token-level differences in decisiveness-many tokens contribute little to item discrimination yet can dominate optimization or decoding. To quantify token decisiveness, we propose a novel perspective that models item generation as a decision process, measuring token decisiveness by the Information Gain (IG) each token provides in reducing uncertainty about the generated item. Our empirical analysis reveals that most tokens have low IG but often correspond to high logits, disproportionately influencing training loss and decoding, which may impair model performance. Building on these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zjlin2oo1/igd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Access Control and Trust · Data Quality and Management