Offline Model-Based Optimization by Learning to Rank
Rong-Xi Tan, Ke Xue, Shen-Huan Lyu, Haopu Shang, Yao Wang, Yaoyuan, Wang, Sheng Fu, Chao Qian

TL;DR
This paper introduces a ranking-based approach for offline model-based optimization, focusing on maintaining the order of candidate designs rather than precise score prediction, leading to improved design selection.
Contribution
It proposes learning to rank techniques for offline MBO, addressing the misalignment of regression models with the goal of selecting top designs, and demonstrates superior empirical performance.
Findings
Ranking-based models better correlate with final design quality.
Order-maintenance metrics outperform MSE in predicting design success.
Proposed method outperforms twenty existing approaches.
Abstract
Offline model-based optimization (MBO) aims to identify a design that maximizes a black-box function using only a fixed, pre-collected dataset of designs and their corresponding scores. A common approach in offline MBO is to train a regression-based surrogate model by minimizing mean squared error (MSE) and then find the best design within this surrogate model by different optimizers (e.g., gradient ascent). However, a critical challenge is the risk of out-of-distribution errors, i.e., the surrogate model may typically overestimate the scores and mislead the optimizers into suboptimal regions. Prior works have attempted to address this issue in various ways, such as using regularization techniques and ensemble learning to enhance the robustness of the model, but it still remains. In this paper, we argue that regression models trained with MSE are not well-aligned with the primary goal…
Peer Reviews
Decision·ICLR 2025 Poster
**Originality**: This paper utilizes the existing LTR loss functions for offline MBO problems. This is a novel way to approach the offline MBO problem. **Quality**: I found some concerns which I will raise in the weakness section. **Clarity**: Overall, the paper has clarity. **Significance**: This new approach to offline MBO could be of significance, if the following questions are answered.
Although the approach is novel, there are concerns regarding how it can generalize performance and alleviate the out-of-distribution issue. It is not convincing that just plugging an LTR loss would address would solve the problem. (See Questions)
Paper was clearly written and original. To the best of my knowledge, the LTR framework was not previously used for offline optimization. Empirical results show positive improvement over previous baselines.
I think the idea that MSE is not always optimal for offline opt was recently pointed out in the MATCH-OPT paper (Hoang et al., 2024) (in which they use the term "value-matching surrogates" to describe model trained with MSE loss). Personally, I think that MSE is not worse than any other metric for optimization, but it could be harder to be estimated accurately for OOD samples. This seems to have been acknowledged in the MATCH-OPT paper. Section 3.1 and 3.2 of this paper seems to focus on justify
- Utilizing ranking scores in place of MSE to train the offline optimization surrogate model is an innovative approach. It aligns more intuitively with the objectives of offline optimization, making it a fitting metric for this context. - The algorithm is very clear and easy to understand. At the same time, this paper conducts a theoretical analysis of generalization error bound. - The authors conducte a wide range of experiments, providing comprehensive comparisons with the latest methods. Th
- Some concepts need to be clarified, such as OOD in the ranking model and Recall@k in Definition 1. These are raised in the "questions" below. - Typos. For example, "e.g" in line 48 should be "e.g.", "caculated" in line 241 should be "calculated". A thorough proofreading is recommended.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReal-time simulation and control systems · Advanced Control Systems Optimization · Simulation Techniques and Applications
