Goal-Conditioned Supervised Learning for Multi-Objective Recommendation

Shijun Li; Hilaf Hasson; Jing Hu; Joydeep Ghosh

arXiv:2412.08911·cs.LG·May 16, 2025

Goal-Conditioned Supervised Learning for Multi-Objective Recommendation

Shijun Li, Hilaf Hasson, Jing Hu, Joydeep Ghosh

PDF

Open Access 3 Reviews

TL;DR

This paper presents MOGCSL, a framework for multi-objective goal-conditioned supervised learning that improves multi-objective recommendation by filtering noise and modeling high-achievable goals, demonstrated on real-world datasets.

Contribution

MOGCSL extends GCSL to multi-objective scenarios with a novel goal-selection algorithm, eliminating complex architectures and effectively handling noisy data in recommender systems.

Findings

01

MOGCSL outperforms baseline methods on real-world recommendation datasets.

02

It effectively filters out uninformative or noisy training instances.

03

The framework demonstrates robustness and scalability in multi-objective recommendation tasks.

Abstract

Multi-objective learning endeavors to concurrently optimize multiple objectives using a single model, aiming to achieve high and balanced performance across diverse objectives. However, this often entails a more complex optimization problem, particularly when navigating potential conflicts between objectives, leading to solutions with higher memory requirements and computational complexity. This paper introduces a Multi-Objective Goal-Conditioned Supervised Learning (MOGCSL) framework for automatically learning to achieve multiple objectives from offline sequential data. MOGCSL extends the conventional GCSL method to multi-objective scenarios by redefining goals from one-dimensional scalars to multi-dimensional vectors. It benefits from naturally eliminating the need for complex architectures and optimization constraints. Moreover, MOGCSL effectively filters out uninformative or noisy…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 4Confidence 4

Strengths

1. The manuscript reframes multi-objective recommendation as a goal-conditioned supervised learning problem. This approach avoids the need for complex multi-task architectures or explicit conflict-handling during optimization, which is an insightful contribution. 2. MOGCSL is shown to be more efficient than baselines in both model size and training speed. This is a crucial advantage for large-scale industrial applications. 3. The work commendably tackles the difficult problem of goal selection d

Weaknesses

1. The manuscript's main technical contribution, a CVAE-based goal selection algorithm (MOGCSL-C), fails to consistently outperform a simple statistical heuristic (MOGCSL-S) and performs worse on one dataset. This undermines the algorithm's practical value and questions the significance of its theoretical backing. 2. The explanation for the CVAE method's failure—data sparsity—feels like a post-hoc justification. The supporting experiment is relegated to an appendix and uses a different dataset,

Reviewer 02Rating 2Confidence 3

Strengths

1. The writing is clear and easy to follow. 2. The investigated multi-task learning problem is essential in recommendation system field.

Weaknesses

1. In experiments, it would be beneficial to include more recent baselines with high reputations. On the MTL aspect, it seems that only DWA, PE and FAMO are involved. However, there are massive important MTL optimizers, including but not limited to MGDA, PCGrad, UPGrad, MoDo, Nash-MTL, GradNorm, etc. Moreover, since authors mainly consider the sequential recommendation problem, it is not very clear why sequential recommendation models (e.g., FMLP-Rec, DuoRec, Longer) are not involved as baselin

Reviewer 03Rating 6Confidence 4

Strengths

1. The experiment of this paper is comprehensive. They also show some comparisons about MOPRL and MOGCSL in the Appendix. I like the comparison between MOPRL and MOGCSL with different weights (MOGCSL remains unchanged). This paper also contains many baselines and shows that MOGCSL achieves the best performance among many baselines. 2. The algorithms are proposed in a clear way. The paper is also well-structured and easy to follow. 3. The idea of using multi-objective GCSL to provide a sol

Weaknesses

1. In Line 146, the author claims that the dataset contains the $s_t$, which is the representation of the user's preferences. It does not practical in the real world. What does the real representation look like in your experiment? 2. The model structure introduced in Section 3.2 appears very similar to PRL, with the only difference being the addition of an extra MLP layer. Therefore, the improved performance over previous Shared-Bottom and MMoE models may largely stem from this architectural ch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques

MethodsFocus