Synthesizing Scoring Functions for Rankings Using Symbolic Gradient Descent
Zixuan Chen, Panagiotis Manolios, Mirek Riedewald

TL;DR
This paper introduces RankHow, a system for synthesizing linear scoring functions to replicate given rankings, utilizing a novel MILP formulation and a symbolic gradient descent method to improve scalability and accuracy.
Contribution
The paper presents a new MILP-based approach for ranking function synthesis and introduces Sym-GD, a novel approximation technique for faster optimization.
Findings
RankHow outperforms existing methods in accuracy.
Sym-GD significantly improves scalability.
The MILP formulation is more efficient than tree-based algorithms.
Abstract
Given a relation and a ranking of its tuples, but no information about the ranking function, we are interested in synthesizing simple scoring functions that reproduce the ranking. Our system RankHow identifies linear scoring functions that minimize position-based error, while supporting flexible constraints on their weights. It is based on a new formulation as a mixed-integer linear program (MILP). While MILP is NP-hard in general, we show that RankHow is orders of magnitude faster than a tree-based algorithm that guarantees polynomial time complexity (PTIME) in the number of input tuples by reducing the MILP problem to many linear programs (LPs). We hypothesize that this is caused by 2 properties: First, the PTIME algorithm is equivalent to a naive evaluation strategy for the MILP program. Second, MILP solvers rely on advanced heuristics to reason holistically about the entire program,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference
