Goal-Conditioned Supervised Learning for Multi-Objective Recommendation
Shijun Li, Hilaf Hasson, Jing Hu, Joydeep Ghosh

TL;DR
This paper presents MOGCSL, a framework for multi-objective goal-conditioned supervised learning that improves multi-objective recommendation by filtering noise and modeling high-achievable goals, demonstrated on real-world datasets.
Contribution
MOGCSL extends GCSL to multi-objective scenarios with a novel goal-selection algorithm, eliminating complex architectures and effectively handling noisy data in recommender systems.
Findings
MOGCSL outperforms baseline methods on real-world recommendation datasets.
It effectively filters out uninformative or noisy training instances.
The framework demonstrates robustness and scalability in multi-objective recommendation tasks.
Abstract
Multi-objective learning endeavors to concurrently optimize multiple objectives using a single model, aiming to achieve high and balanced performance across diverse objectives. However, this often entails a more complex optimization problem, particularly when navigating potential conflicts between objectives, leading to solutions with higher memory requirements and computational complexity. This paper introduces a Multi-Objective Goal-Conditioned Supervised Learning (MOGCSL) framework for automatically learning to achieve multiple objectives from offline sequential data. MOGCSL extends the conventional GCSL method to multi-objective scenarios by redefining goals from one-dimensional scalars to multi-dimensional vectors. It benefits from naturally eliminating the need for complex architectures and optimization constraints. Moreover, MOGCSL effectively filters out uninformative or noisy…
Peer Reviews
Decision·Submitted to ICLR 2026
1. The manuscript reframes multi-objective recommendation as a goal-conditioned supervised learning problem. This approach avoids the need for complex multi-task architectures or explicit conflict-handling during optimization, which is an insightful contribution. 2. MOGCSL is shown to be more efficient than baselines in both model size and training speed. This is a crucial advantage for large-scale industrial applications. 3. The work commendably tackles the difficult problem of goal selection d
1. The manuscript's main technical contribution, a CVAE-based goal selection algorithm (MOGCSL-C), fails to consistently outperform a simple statistical heuristic (MOGCSL-S) and performs worse on one dataset. This undermines the algorithm's practical value and questions the significance of its theoretical backing. 2. The explanation for the CVAE method's failure—data sparsity—feels like a post-hoc justification. The supporting experiment is relegated to an appendix and uses a different dataset,
1. The writing is clear and easy to follow. 2. The investigated multi-task learning problem is essential in recommendation system field.
1. In experiments, it would be beneficial to include more recent baselines with high reputations. On the MTL aspect, it seems that only DWA, PE and FAMO are involved. However, there are massive important MTL optimizers, including but not limited to MGDA, PCGrad, UPGrad, MoDo, Nash-MTL, GradNorm, etc. Moreover, since authors mainly consider the sequential recommendation problem, it is not very clear why sequential recommendation models (e.g., FMLP-Rec, DuoRec, Longer) are not involved as baselin
1. The experiment of this paper is comprehensive. They also show some comparisons about MOPRL and MOGCSL in the Appendix. I like the comparison between MOPRL and MOGCSL with different weights (MOGCSL remains unchanged). This paper also contains many baselines and shows that MOGCSL achieves the best performance among many baselines. 2. The algorithms are proposed in a clear way. The paper is also well-structured and easy to follow. 3. The idea of using multi-objective GCSL to provide a sol
1. In Line 146, the author claims that the dataset contains the $s_t$, which is the representation of the user's preferences. It does not practical in the real world. What does the real representation look like in your experiment? 2. The model structure introduced in Section 3.2 appears very similar to PRL, with the only difference being the addition of an extra MLP layer. Therefore, the improved performance over previous Shared-Bottom and MMoE models may largely stem from this architectural ch
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques
MethodsFocus
