MLPlatt: Simple Calibration Framework for Ranking Models
Piotr Bajger, Roman Dusek, Krzysztof Galias, Pawe{\l} M{\l}yniec, Aleksander Wawer, Pawe{\l} Zawistowski

TL;DR
MLPlatt is a straightforward post-hoc calibration method for ranking models in e-commerce, improving interpretability and calibration accuracy without sacrificing ranking performance.
Contribution
Introduces MLPlatt, a simple, context-aware calibration framework that preserves ranking order and enhances calibration metrics in ranking models.
Findings
Achieves over 10% improvement in F-ECE compared to existing methods.
Maintains ranking quality while improving calibration.
Effective across different contexts and categorical strata.
Abstract
Ranking models are extensively used in e-commerce for relevance estimation. These models often suffer from poor interpretability and no scale calibration, particularly when trained with typical ranking loss functions. This paper addresses the problem of post-hoc calibration of ranking models. We introduce MLPlatt: a simple yet effective ranking model calibration method that preserves the item ordering and converts ranker outputs to interpretable click-through rate (CTR) probabilities usable in downstream tasks. The method is context-aware by design and achieves good calibration metrics globally, and within strata corresponding to different values of a selected categorical field (such as user country or device), which is often important from a business perspective of an E-commerce platform. We demonstrate the superiority of MLPlatt over existing approaches on two datasets, achieving an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Recommender Systems and Techniques · Multimodal Machine Learning Applications
