Distributionally Robust Multi-Output Regression Ranking
Shahabeddin Sotudian, Ruidi Chen, Ioannis Paschalidis

TL;DR
This paper introduces DRMRR, a robust listwise learning-to-rank model that effectively handles data errors, shifts, and adversarial perturbations, outperforming existing models in real-world applications.
Contribution
The paper proposes a novel distributionally robust multi-output ranking model using Wasserstein-based DRO, capturing local and cross-document interactions for improved robustness.
Findings
DRMRR outperforms state-of-the-art LTR models in medical retrieval and drug response tasks.
DRMRR maintains stable performance under various noise conditions.
The model effectively resists adversarial and label noise perturbations.
Abstract
Despite their empirical success, most existing listwiselearning-to-rank (LTR) models are not built to be robust to errors in labeling or annotation, distributional data shift, or adversarial data perturbations. To fill this gap, we introduce a new listwise LTR model called Distributionally Robust Multi-output Regression Ranking (DRMRR). Different from existing methods, the scoring function of DRMRR was designed as a multivariate mapping from a feature vector to a vector of deviation scores, which captures local context information and cross-document interactions. DRMRR uses a Distributionally Robust Optimization (DRO) framework to minimize a multi-output loss function under the most adverse distributions in the neighborhood of the empirical data distribution defined by a Wasserstein ball. We show that this is equivalent to a regularized regression problem with a matrix norm regularizer.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Imbalanced Data Classification Techniques
