Synthesizing Scoring Functions for Rankings Using Symbolic Gradient   Descent

Zixuan Chen; Panagiotis Manolios; Mirek Riedewald

arXiv:2406.11797·cs.DB·April 30, 2025

Synthesizing Scoring Functions for Rankings Using Symbolic Gradient Descent

Zixuan Chen, Panagiotis Manolios, Mirek Riedewald

PDF

Open Access

TL;DR

This paper introduces RankHow, a system for synthesizing linear scoring functions to replicate given rankings, utilizing a novel MILP formulation and a symbolic gradient descent method to improve scalability and accuracy.

Contribution

The paper presents a new MILP-based approach for ranking function synthesis and introduces Sym-GD, a novel approximation technique for faster optimization.

Findings

01

RankHow outperforms existing methods in accuracy.

02

Sym-GD significantly improves scalability.

03

The MILP formulation is more efficient than tree-based algorithms.

Abstract

Given a relation and a ranking of its tuples, but no information about the ranking function, we are interested in synthesizing simple scoring functions that reproduce the ranking. Our system RankHow identifies linear scoring functions that minimize position-based error, while supporting flexible constraints on their weights. It is based on a new formulation as a mixed-integer linear program (MILP). While MILP is NP-hard in general, we show that RankHow is orders of magnitude faster than a tree-based algorithm that guarantees polynomial time complexity (PTIME) in the number of input tuples by reducing the MILP problem to many linear programs (LPs). We hypothesize that this is caused by 2 properties: First, the PTIME algorithm is equivalent to a naive evaluation strategy for the MILP program. Second, MILP solvers rely on advanced heuristics to reason holistically about the entire program,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference