Multiclass MinMax Rank Aggregation

Pan Li; Olgica Milenkovic

arXiv:1701.08305·cs.LG·February 6, 2017

Multiclass MinMax Rank Aggregation

Pan Li, Olgica Milenkovic

PDF

Open Access

TL;DR

This paper introduces new minmax rank aggregation problems using Kendall tau and Spearman footrule distances, providing approximation algorithms and demonstrating their applications on Mallows model and genomic data.

Contribution

It presents the first constant-approximation algorithms for NP-hard minmax rank aggregation problems under two distance measures.

Findings

01

Algorithms achieve constant approximation ratios.

02

Applications demonstrate effectiveness on real data.

03

Framework applicable to various ranking scenarios.

Abstract

We introduce a new family of minmax rank aggregation problems under two distance measures, the Kendall {\tau} and the Spearman footrule. As the problems are NP-hard, we proceed to describe a number of constant-approximation algorithms for solving them. We conclude with illustrative applications of the aggregation methods on the Mallows model and genomic data.

Tables4

mmKT-Conv

(V, 𝐮)

1: Choose the pivot

v \in V

according to

v = {argmin}_{a} \max_{k} \frac{A_{a}^{k} ​ (𝐮)}{B_{a}^{k} ​ (𝐮)} .

2: Set

V_{L} = \emptyset, V_{R} = \emptyset

.

3: For all

x \in V_{v}

:

If

h_{x ​ v} = 1

,

V_{L} \leftarrow V_{L} \cup {x}

. Otherwise,

V_{R} \leftarrow V_{R} \cup {x}

.

4: Return [mmKT-Conv

(V_{L}, 𝐮)

,

v

, mmKT-Conv

(V_{R}, 𝐮)

].

min-Pick-Perm $(Σ^{1}, Σ^{2}, \dots, Σ^{C})$ , $(λ_{1}, λ_{2}, \dots, λ_{C})$ .
1: For each $k \in C$ and each ranking $σ_{i}^{k} \in Σ^{k}$
2: Compute Score ${}_{i}^{k}= \max_{j \in C / {k}} λ_{j} \min_{σ_{s}^{j} \in Σ^{j}} d_{⋆} (σ_{i}^{k}, σ_{s}^{j}) .$
3: Let $(i^{}, k^{}) = \arg_{(i, k)} \min$ Score ${}^{k}_{i}$ . Output $π = σ_{i^{}}^{k^{}}$ .

Table 3. TABLE I: Comparison of rank aggregation methods: Objective value (standard deviation)

A. $d_{m e d - τ}$
$ϕ_{1}$	0.5	0.7	0.9	1.0
mmKT-Conv	14.5 (1.1)	16.3 (1.4)	17.8 (1.3)	17.9 (1.5)
Pick-Rnd-Perm	17.8 (1.4)	19.9 (2.1)	21.5 (1.8)	21.6 (2.1)
Pick-Opt-Perm	15.9 (1.8)	18.1 (1.8)	20.0 (1.7)	20.0 (1.6)
FASLP-Pivot	15.3 (1.4)	17.7 (2.1)	19.4 (2.2)	19.7 (2.3)

Table 4. TABLE II: Mitochondrial DNA (mtDNA) aggregation

	$d_{m e d - τ}$	Aggregated Sequences
mmKT-Conv	210	1 10 7 2 17 12 30 9 11 23 19 20 21
		13 35 3 15 14 25 26 6 16 32 28 34
		4 24 27 18 36 29 31 8 33 22 5
Pick-Opt-Perm	267	1 27 2 17 36 20 3 29 10 11 35 12 30
		21 9 19 18 28 33 7 8 16 26 14 34 13
		24 15 32 25 4 22 23 6 31 5
FASLP-Pivot	269	1 2 17 7 23 12 3 20 30 21 6 9 10
		11 15 19 28 25 27 18 32 8 33 24 13 34
		14 4 35 29 26 16 36 31 22 5

Equations66

MinMax: π min k max λ_{k} d (π, Σ^{k}),

MinMax: π min k max λ_{k} d (π, Σ^{k}),

σ (x)

σ (x)

+ \frac{1}{2} (∣ {y \in [n] : y is tied with x} ∣ + 1) .

d_{τ} (σ, π) = ∣ {(x, y) : π (x) > π (y), σ (x) < σ (y)} ∣.

d_{τ} (σ, π) = ∣ {(x, y) : π (x) > π (y), σ (x) < σ (y)} ∣.

d_{S} (σ, π) = x \in [n] \sum ∣ σ (x) - π (x) ∣.

d_{S} (σ, π) = x \in [n] \sum ∣ σ (x) - π (x) ∣.

d_{K} (π, σ) =

d_{K} (π, σ) =

+

π (x) > π (y), σ (x) = σ (y)} ∣.

d_{p r S} (σ, π) = x \in [n] \sum ∣ σ (x) - π (x) ∣,

d_{p r S} (σ, π) = x \in [n] \sum ∣ σ (x) - π (x) ∣,

d_{m e d -⋆} (π, Σ) = \frac{1}{∣Σ∣} σ \in Σ \sum d_{⋆} (π, σ) .

d_{m e d -⋆} (π, Σ) = \frac{1}{∣Σ∣} σ \in Σ \sum d_{⋆} (π, σ) .

d_{min -⋆} (π, Σ) = σ \in Σ min d_{⋆} (π, σ) .

d_{min -⋆} (π, Σ) = σ \in Σ min d_{⋆} (π, σ) .

MinMax: π min k max λ_{k} d (π, Σ^{k}),

MinMax: π min k max λ_{k} d (π, Σ^{k}),

λ_{k} d_{m e d -⋆} (π, Σ^{k}) = \frac{λ _{k}}{m _{k}} i = 1 \sum m_{k} d_{⋆} (π, Σ^{k})

λ_{k} d_{m e d -⋆} (π, Σ^{k}) = \frac{λ _{k}}{m _{k}} i = 1 \sum m_{k} d_{⋆} (π, Σ^{k})

⩽

E [k max λ_{k} d_{m e d -⋆} (π, Σ^{k})] ⩽ λ^{*} E [d_{⋆} (π, π^{*})] + W ⩽ 2 W .

E [k max λ_{k} d_{m e d -⋆} (π, Σ^{k})] ⩽ λ^{*} E [d_{⋆} (π, π^{*})] + W ⩽ 2 W .

u, q min

u, q min

x, y \in [n] \sum w_{x y}^{k} u_{y x} ⩽ q for all k \in [C]

u_{x y} \in {0, 1},

u_{x y} + u_{y x} = 1 for all i, j \in [n], i \neq = j

u_{x y} + u_{y z} + u_{z x} ⩾ 1 for all distincts x, y, z \in [n]

P_{v} (u)

P_{v} (u)

A_{v}^{k} (u)

B_{v}^{k} (u)

h_{x y} w_{y x} + h_{y x} w_{x y} ⩽ 2 (u_{x y} w_{y x} + u_{y x} w_{x y}),

h_{x y} w_{y x} + h_{y x} w_{x y} ⩽ 2 (u_{x y} w_{y x} + u_{y x} w_{x y}),

\sum h_{x z} h_{z y} w_{y x} ⩽ 2 \sum h_{x z} h_{z y} (u_{x y} w_{y x} + u_{y x} w_{x y}),

\sum h_{x z} h_{z y} w_{y x} ⩽ 2 \sum h_{x z} h_{z y} (u_{x y} w_{y x} + u_{y x} w_{x y}),

(1 - 2 u_{x y}) w_{y x} + (1 - 2 u_{y z}) w_{z y} + (1 - 2 u_{z x}) w_{x z} -

(1 - 2 u_{x y}) w_{y x} + (1 - 2 u_{y z}) w_{z y} + (1 - 2 u_{z x}) w_{x z} -

2 u_{x z} w_{z x} - 2 u_{y x} w_{x y} - 2 u_{z y} w_{y z} .

2 u_{x z} w_{z x} - 2 u_{y x} w_{x y} - 2 u_{z y} w_{y z} .

u^{*} = u \in R^{n} min k max \frac{λ _{k}}{m _{k}} g = 1 \sum m_{k} ∣∣ u - σ_{g}^{k} ∣ ∣_{1},

u^{*} = u \in R^{n} min k max \frac{λ _{k}}{m _{k}} g = 1 \sum m_{k} ∣∣ u - σ_{g}^{k} ∣ ∣_{1},

∣∣ π_{S} - σ ∣ ∣_{1} ⩽ ∣∣ π_{S} - u^{*} ∣ ∣_{1} + ∣∣ σ - u^{*} ∣ ∣_{1} ⩽ 2∣∣ σ - u^{*} ∣ ∣_{1} .

∣∣ π_{S} - σ ∣ ∣_{1} ⩽ ∣∣ π_{S} - u^{*} ∣ ∣_{1} + ∣∣ σ - u^{*} ∣ ∣_{1} ⩽ 2∣∣ σ - u^{*} ∣ ∣_{1} .

\frac{λ _{k} λ _{j}}{λ _{k} + λ _{j}} d_{⋆} (σ_{1}^{k}, σ_{1}^{j}) ⩽ W .

\frac{λ _{k} λ _{j}}{λ _{k} + λ _{j}} d_{⋆} (σ_{1}^{k}, σ_{1}^{j}) ⩽ W .

k \in [C] min j \in [C] / {k} max λ_{j} d_{⋆} (σ_{1}^{k}, σ_{1}^{j}) ⩽ λ_{k^{'}} d_{⋆} (σ_{1}^{\tilde{k}}, σ_{1}^{k^{'}})

k \in [C] min j \in [C] / {k} max λ_{j} d_{⋆} (σ_{1}^{k}, σ_{1}^{j}) ⩽ λ_{k^{'}} d_{⋆} (σ_{1}^{\tilde{k}}, σ_{1}^{k^{'}})

⩽ \frac{2 λ _{\tilde{k}} λ _{k^{'}}}{λ _{\tilde{k}} + λ _{k^{'}}} d_{⋆} (σ_{1}^{\tilde{k}}, σ_{1}^{k^{'}}) ⩽ 2 k, j max \frac{λ _{k} λ _{j}}{λ _{k} + λ _{j}} d_{⋆} (σ_{1}^{k}, σ_{1}^{j}) .

j \in [C] max d_{m i n -⋆} (π, Σ^{j}) = k \in [C] min σ_{i}^{k} \in Σ^{k} min j \in [C] / k max σ_{g}^{j} \in Σ^{j} min d_{⋆} (σ_{i}^{k}, σ_{g}^{j})

j \in [C] max d_{m i n -⋆} (π, Σ^{j}) = k \in [C] min σ_{i}^{k} \in Σ^{k} min j \in [C] / k max σ_{g}^{j} \in Σ^{j} min d_{⋆} (σ_{i}^{k}, σ_{g}^{j})

⩽ k \in [C] min j \in [C] / {k} max λ_{j} d_{⋆} (σ_{1}^{k}, σ_{1}^{j}) .

\tilde{Σ}^{j} = {σ \in Σ^{j} : d_{⋆} (σ_{i^{*}}^{k^{*}}, σ) = d_{min -⋆} (σ_{i^{*}}^{k^{*}}, Σ^{j})}

\tilde{Σ}^{j} = {σ \in Σ^{j} : d_{⋆} (σ_{i^{*}}^{k^{*}}, σ) = d_{min -⋆} (σ_{i^{*}}^{k^{*}}, Σ^{j})}

j \in [C] max λ_{j} d_{min -⋆} (π^{'}, Σ^{j}) ⩽ j \in [C] max λ_{j} d_{m e d -⋆} (π^{'}, \tilde{Σ}^{j})

j \in [C] max λ_{j} d_{min -⋆} (π^{'}, Σ^{j}) ⩽ j \in [C] max λ_{j} d_{m e d -⋆} (π^{'}, \tilde{Σ}^{j})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Game Theory and Voting Systems · Data Management and Algorithms

Full text

Multiclass MinMax Rank Aggregation

Pan Li and Olgica Milenkovic

ECE Department, University of Illinois at Urbana-Champaign

Email: [email protected], [email protected]

Abstract

We introduce a new family of minmax rank aggregation problems under two distance measures, the Kendall $\tau$ and the Spearman footrule. As the problems are NP-hard, we proceed to describe a number of constant-approximation algorithms for solving them. We conclude with illustrative applications of the aggregation methods on the Mallows model and genomic data.

I Introduction

Rankings, a special form of ordinal data, have received significant attention in the machine learning community as they arise in a number of important application domains, such as recommender systems, social voting and product placement platforms. Of particular importance are rankings of the form of linear orders (permutations) and partial rankings (weak orders), which are frequently obtained through conversion from ratings. One of the main processing tasks for rankings is rank aggregation, which often involves evaluating the median of a set of permutations or partial rankings under a suitably chosen distance function [2, 4, 7, 9, 11, 12, 16]. The median rank aggregation problem under the Kendall $\tau$ distance was introduced by Kemeny [11], and was proved to be NP-hard by Bartholdi et al. [4]. A number of approximation algorithms for the problem have been described in [2], mostly pertaining to permutations; a corresponding PTAS (polynomial time approximation scheme) was proposed in [12]. In the context of partial ranking aggregation, known solutions include the results of [1, 10]. Median aggregation under other distance functions has received less attention, one notable exception being the Spearman rank aggregation problem [7], which is known to provide a constant approximation for Kendall $\tau$ aggregation using a polynomial time algorithm based on weighted bipartite matching [9].

We propose to investigate a broad new family of rank aggregation problems in which the median is replaced by a minmax type of function and where the rankings are grouped in classes. More precisely, assume that there are $C\geqslant 1$ different classes of rankings and let $\Sigma^{k}=\{\sigma_{1}^{k},\sigma_{2}^{k},...,\sigma_{m_{k}}^{k}\}$ be the set of $m_{k}=|\Sigma^{k}|$ rankings belonging to the class labeled by $k\in[C]$ . Our minmax rank aggregation problem may be succinctly described as follows: Output a ranking $\pi$ that agrees in the minmax sense with the rankings belonging to the different classes. Rigorously, we seek to solve the following optimization problem:

[TABLE]

where $\lambda_{k}>0$ represent the costs of violating the agreement with rankings in class $k$ . In the above formulation, $d(\pi,\Sigma)$ stands for a distance between a ranking or partial ranking $\pi$ and a set of rankings $\Sigma^{k}$ , and it may be chosen to be of the form of a median distance (which equals the total sum of distances between $\pi$ and the elements of $\Sigma^{k}$ ) or a minimum distance (which equals the smallest distance between $\pi$ and an element in $\Sigma^{k}$ ). The above described MinMax problem is motivated by a number of applications in which classes of rankings arise due to different ranking criteria or properties of the ranking entities (social platforms) or due to prior knowledge of different similarity degrees in groups of rankings (genome evolution). The minmax criteria is typically used when trying to ensure that the aggregate violates each vote (class of votes) to roughly the same extent.

We start our analysis with the MinMax problem with $C=1$ and under the median and minimum distance, and then proceed to study the problem for the case of arbitrary values of $C$ and $m_{k}$ , $k=1,\ldots,C$ . For both the case of the Kendall $\tau$ as well as the Spearman footrule in the median and minimum distance setting, the MinMax problems may be shown to be NP-hard by using the corresponding results of [3]. In particular, the work in [3] outlines a general framework for proving NP-hardness results for the median, single class min-max-aggregation problem under different ranking distances. Nevertheless, only a handful of approximation algorithms were proposed even for this basic min-max-aggregation form: To the best of our knowledge, the only provable algorithm for the single class MinMax under the minimum distance measure was provided in [3]. The algorithm takes the form of the well studied ”pick-a-permutation” method, and tends to perform poorly in practice.

The main results of our work include families of constant approximation algorithm for the new, general family of multiclass MinMax problems, both under the median and minimum class distance, evaluated using the Kendall $\tau$ and Spearman footrule. Furthermore, we illustrate the use of the new aggregation paradigm on the problem of finding an ancestral genome arrangement for mitochondrial DNA under the tandem duplication model for genomes [6].

II Mathematical Preliminaries

Let $S$ denote a set of $n$ elements, which without loss of generality we set to $[n]\equiv\{{1,2,\ldots,n\}}$ . A ranking is an ordering of a subset of elements $Q$ of $[n]$ according to a predefined rule. When $Q=[n]$ , the resulting order is referred to as a permutation. When the rankings include ties, they are referred to as partial rankings [10].

More precisely, a permutation is a bijection $\sigma\,:\,[n]\rightarrow[n]$ , and the set of permutations over $[n]$ forms the symmetric group of order $n!$ , denoted by $\mathbb{S}_{n}$ . For any $\sigma\in\mathbb{S}_{n}$ and $x\in[n]$ , $\sigma(x)$ denotes the rank (position) of the element $x$ in $\sigma$ . We say that $x$ is ranked higher than $y$ (ranked lower than $y$ ) iff $\sigma(x)<\sigma(y)$ ( $\sigma(x)>\sigma(y)$ ). The inverse of a permutation $\sigma$ is denoted by $\sigma^{-1}:[n]\rightarrow[n]$ . Clearly, $\sigma^{-1}(t)$ represents the element ranked at position $t$ in $\sigma$ . Similarly, partial rankings [10] represent a mapping over $[n]$ in which there may exist two elements $x\neq y$ such that $\sigma(x)=\sigma(y)$ . It is common to use $\sigma(x)$ to denote the position of the element $x$ in the partial ranking $\sigma$ , and to define it as

[TABLE]

A number of distance functions between rankings were proposed in the literature [7, 10, 14]. One distance function counts the number of adjacent transpositions needed to convert a permutation into another. Adjacent transpositions generate $\mathbb{S}_{n}$ , i.e., any permutation $\pi\in\mathbb{S}_{n}$ can be converted into another permutation $\sigma\in\mathbb{S}_{n}$ through a sequence of adjacent transpositions [14]. The smallest number of adjacent transpositions needed to convert a permutation $\pi$ into another permutation $\sigma$ is termed the Kendall $\tau$ distance, denoted by $d_{\tau}(\pi,\sigma)$ . The Kendall $\tau$ distance between two permutations $\pi$ and $\sigma$ over $[n]$ also equals the number of pairwise inversions of elements of the two permutations:

[TABLE]

Another positional distance measure is the Spearman footrule,

[TABLE]

It can be shown that $d_{\tau}(\pi,\sigma)\leqslant d_{S}(\pi,\sigma)\leqslant 2d_{\tau}(\pi,\sigma)$ [7].

One may similarly define a generalization of the Kendall $\tau$ distance for partial rankings $\pi$ and $\sigma$ over the set $[n]$ . This distance is known as the Kemeny distance, and equals

[TABLE]

The Spearman footrule analogue for partial rankings [10] equals the sum of the absolute differences between “positions” of elements in the partial rankings,

[TABLE]

where positions are as defined in (II). The Spearman footrule distance for partial rankings is a $2$ -approximation for the Kemeny distance [10].

The notion of a distance between two rankings has an important extension in terms of a distance between a ranking and a set of rankings, which we refer to as rank-set distances. We focus our attention on two types of rank-set distances, defined below. For compactness, we use $\star$ to denote an arbitrary distance on pairs of rankings, but focus our attention throughout the paper on $\star\in\{\tau,S,K,prS\}$ .

Definition II.1.

Suppose that $\pi$ is a ranking and that $\Sigma$ is a set of rankings. Given a distance between two rankings $d_{\star}(\cdot,\cdot)$ , the median- $\star$ distance ( $med-\star$ ) between $\pi$ and $\Sigma$ equals

[TABLE]

Definition II.2.

Suppose that $\pi$ is a ranking and that $\Sigma$ is a set of rankings. Given a distance between two rankings $d_{\star}(\cdot,\cdot)$ , the min- $\star$ distance ( $min-\star$ ) between $\pi$ and $\Sigma$ is defined as

[TABLE]

We recall that the focal problem of this work is to find constant approximation algorithms for the MinMax rank aggregation problem, which reads as

[TABLE]

where $d(\pi,\Sigma^{k})$ is a $med-\star$ or $min-\star$ distance, with $\star\in\{\tau,S,K,prS\}$ . In our future analysis we use $\lambda^{*}\triangleq\max_{k}\lambda_{k}$ and $\mathcal{M}\triangleq\{k:\lambda_{k}=\lambda^{*}\}$ . Furthermore, we let $\pi^{*}$ denote the argument of the optimal solution of the MinMax problem and let $W=\max_{k}\lambda_{k}\,d(\pi^{*},\Sigma^{k})$ .

III Approximate MinMax Aggregation

As previously pointed out, the MinMax problem under both the $med-\star$ and $min-\star$ can be shown to be NP-hard using the results of [3], which established hardness for the special case $m_{k}=1$ and $d(\cdot,\cdot)$ a pseudometric. We hence focus on devising approximation algorithms for the MinMax problem.

III-A Permutations

We first consider ordinal data of the form of permutations. We show that a simple algorithm, which we term Pick-Rnd-Perm, can achieve a $2$ -approximation in expectation for the case of the $med-\star$ problem whenever $d_{\star}(\cdot,\cdot)$ is a pseudometric. Then, for $\star\in\{\tau,S\}$ , we describe two $2$ -approximation algorithms that use a combination of convex optimization and rounding procedures and offer significantly better empirical performance than random selection. Finally, we describe a $2$ -approximation algorithm for the $min-\star$ problems when $d_{\star}(\cdot,\cdot)$ is a pseudometric. The selection algorithm essentially transforms the $min-\star$ problem into a $med-\star$ problem: Thus, the algorithms developed for approximating multiclass $med-\star$ problems may be used to approximate corresponding instances of the $min-\star$ problem.

The Pick-Rnd-Perm Algorithm. Pick a permutation $\pi$ from $\cup_{k\in\mathcal{M}}\Sigma_{k}$ uniformly at random.

Theorem III.1.

For the $d_{med-\star}(\cdot,\cdot)$ distance, where $d_{\star}$ is a pseudometric, the Pick-Rnd-Perm algorithm produces a $2$ -approximation of the $med-\star$ problem.

Proof.

For a given $k$ ,

[TABLE]

By calculating the expectation, we obtain

[TABLE]

$\blacksquare$

Clearly, random selection may be improved by picking the optimal permutation from $\cup_{k\in\mathcal{M}}\Sigma_{k}$ instead. We term this approach Pick-Opt-Perm. Although the Pick-Rnd (Opt) -Perm algorithms are exceptionally simple and offer a $2$ -approximation to the optimal solution, they have a number of drawbacks, including the fact that the aggregate is a given ranking from the clusters, which violates fairness rules of aggregates, and that its empirical performance is typically very poor. To mitigate these problems, we propose more sophisticated aggregation algorithms for both the $med-\tau$ and $med-S$ problems.

Case I: $d=d_{med-\tau}$ . For $C=1$ , a well known method termed random pivoting proposed by Ailon et al. [1, 2] offers a $2$ -approximation in expectation for both the permutation and partial rank aggregation problem. In random pivoting, at each step, one element in the ranking is chosen uniformly at random and the remaining elements are partitioned based on the pairwise comparison with the pivot element. However, for the case of the MinMax problem with $C>1$ , random pivoting may be inadequate: The difficulty lies in the fact that rankings in different classes may lead to widely disparate pairwise pivot comparisons. Another problem in this context is that while one may achieve a constant approximation in expectation for each class individually, the largest cost among classes may not be bounded due to the exchange of the expectation and maximization operators. Therefore, instead of pivoting, one must resort to a different approach to the problem. Our approach is to deterministically round the fractional solution of a specific convex optimization problem. The deterministic rounding procedure is motivated by ideas in [16].

Let $w_{xy}^{k}\triangleq\frac{\lambda_{k}}{m_{k}}\sum_{i=1}^{m_{k}}\mathbf{1}\{\sigma_{i}^{k}(x)<\sigma_{i}^{k}(y)\}$ , where $\mathbf{1}$ stands for the indicator function, and let $w_{xx}^{k}=0$ for all $x,k$ . For a given ranking $\pi$ , also define the variables $u_{xy}\triangleq\mathbf{1}\{\pi(x)<\pi(y)\}$ . The MinMax problem may be stated as

[TABLE]

Note that if the rankings are permutations, then $w_{xy}^{k}+w_{yx}^{k}=\lambda_{k},$ which is a value that only depends on $k$ .

The above integer program may be relaxed to a linear program by allowing $u_{xy}$ to take fractional values. Upon solving the linear program, one needs to round the values of $u_{xy}$ . The next rounding procedure guarantees a $2$ -approximation.

Let $h_{xy}=\mathbf{1}_{u_{xy}\geqslant 1/2},$ if $x>y,$ and $h_{xy}=1-h_{yx},$ if $x<y$ . Let $v$ be a pivoting element for the rounding procedure and use $P_{v}(\mathbf{u})$ to denote the set of pairs of elements (excluding $v$ ) whose positions are determined by pivoting on $v$ . Define

[TABLE]

The rounding procedure makes iterative calls to the the following routine.

Theorem III.2.

The iterative application of the mmKT-Conv algorithm outputs a permutation with at most twice the cost of the optimal solution of the linear program (4).

At each iteration of rounding, $A_{v}^{k}(\mathbf{u})$ denotes the cost of rounding incurred by the class $k$ of rankings, while $B_{v}^{k}(\mathbf{u})$ denotes the associated cost of the linear program for class $k$ . Hence, the goal is to prove that for the given choice of the pivot $v$ , we have $A_{v}^{k}(\mathbf{u})\leqslant 2B_{v}^{k}(\mathbf{u})$ for all $k\in[C]$ . Suppose that $k^{\prime}$ is the index of the class that maximizes $\frac{A_{v}^{k}(\mathbf{u})}{B_{v}^{k}(\mathbf{u})}$ at the first step of mmKT-Conv. Then, it suffices to show that $A_{v}^{k^{\prime}}(\mathbf{u})\leqslant 2B_{v}^{k^{\prime}}(\mathbf{u})$ . This result is a corollary of the following lemma.

Lemma III.3.

$\sum_{v\in V}A_{v}^{k}(\mathbf{u})\leqslant 2\sum_{v\in V}B_{v}^{k}(\mathbf{u})$ , $\forall\,k\in[C]$ .

Proof.

To prove the claimed result, it suffices to prove that for any two distinct elements $x,y$ , one has

[TABLE]

and for any triple of distinct elements $x,y,z$ , one has

[TABLE]

where the summation is circular over all permutations of $x,y,z$ . Both summations are taken over all possible permutations of the two (three) elements in the argument.

The inequality (5) is easy to prove: Suppose that $h_{xy}=1$ . Then the sum on the left hand side equals $w_{yx}\leqslant 2u_{xy}w_{yx}$ which is bounded by the right hand side expression. To prove the inequality (6), consider the six variables associated with $x,y,z$ , namely $h_{xy},h_{yx},h_{xz},h_{zx},h_{yz},h_{zy}$ . These variables may be partitioned into two classes, $\{h_{xy},h_{zx},h_{yz}\}$ and $\{h_{yz},h_{xz},h_{zx}\}$ . There are at least three variables that are 0’s. Without loss of generality, suppose that the class $\{h_{xy},h_{zx},h_{yz}\}$ contains at least two 0’s.

Case 1: Assume that $h_{xy},h_{zx},h_{yz}=0$ . Then, the difference of the left and right hand side of the inequality under consideration equals

[TABLE]

The claimed result then follows from observing that $(1-2u_{xy})w_{yx}\leqslant u_{yx}(w_{yz}+w_{zx})$ .

Case 2: Assume that $h_{xy}=1,h_{zx},h_{yz}=0$ . The left hand side equals $w_{yx}\leqslant 2u_{xy}w_{yx}$ which is clearly bounded from above by the right hand side expression as $h_{xy}=1$ . $\blacksquare$

Case II: $d=d_{med-S}$ . When $C=1$ , the MinMax aggregation problem may be solved in polynomial time via weighted bipartite matching [9]. However, when $C>1$ , the problem is hard even if $m_{k}=1$ for all $k$ [3].

Step 1: If we remove the integral constraint on the position of elements in $\pi$ , the optimization problem of interest is convex and may be solved efficiently:

[TABLE]

where $||\mathbf{u}-\sigma_{g}^{k}||_{1}=\sum_{h\in[n]}|u(h)-\sigma_{g}^{k}(h)|$ .

Step 2 (mmSP-Conv): We assign positions to elements according to the fractional solution $\mathbf{u}^{*}$ as follows. If $u^{*}(x)<u^{*}(y)$ , we let $\pi(x)<\pi(y)$ for any two distinct elements $x,\,y,$ with ties broken randomly.

Theorem III.4.

mmSP-Conv rounding increases the cost of the convex optimization problem (7) at most twice.

Proof.

First, we claim that the output of mmSP-Conv, denoted by $\pi_{S}$ , is in $\Pi^{\prime}\triangleq\{\pi^{\prime}\in\mathbb{S}^{n}:||\mathbf{u}^{*}-\pi^{\prime}||_{1}=\min||\mathbf{u}^{*}-\pi||_{1}\}$ . This follows since for any ranking $\pi$ , if two elements $x,\,y\in[n]$ satisfy $\pi(x)>\pi(y)$ and $u^{*}(x)<u^{*}(y)$ , we may transpose $x$ and $y$ in $\pi$ to obtain a smaller $||\mathbf{u}^{*}-\pi||_{1}$ . Second, for an arbitrary permutation $\sigma$ , we have

[TABLE]

The claim follows by setting $\sigma=\sigma_{i}^{k},$ $i\in[m_{k}],$ $k\in[C]$ . $\blacksquare$

Note that the integrality gap of the problems (4) (7) is $2$ , as one may consider two equally weighted classes, each of which contains one single ranking, $(1,2,3,4,...)$ and $(2,1,3,4,...)$ , respectively. Hence, the best approximation constant via the use of $\mathbf{u}$ cannot be less than $2$ , which implies that the proposed rounding is optimal. One may expect to achieve a smaller approximation constant by outputting the better of the two results produced by Pick-Rnd-Perm and mmKT(SP)-Conv. This approach will be discussed in the full version of the paper.

We introduce next the min-Pick-Perm algorithm for solving the $d_{min-\star}$ problem.

Theorem III.5.

If $d_{\star}$ is pseudometric, then min-Pick-Perm is a $2$ -approximation algorithm for the $min-\star$ problems.

Proof.

By the definition of the $min-\star$ problem, each class contains at least one permutation, which without loss of generality we denote by $\sigma_{1}^{k}\in\Sigma^{k},$ $k\in[C]$ , that satisfies $\lambda_{k}d_{\star}(\pi^{*},\sigma_{1}^{k})\leqslant W$ . As $d_{\star}$ is pseudometric, we have

[TABLE]

Next, choose an arbitrary $\tilde{k}\in\mathcal{M}$ and let $k^{\prime}=\arg\max_{j\in[C]/\{\tilde{k}\}}\lambda_{j}d_{\star}(\sigma_{1}^{\tilde{k}},\sigma_{1}^{j})$ . Then,

[TABLE]

Moreover, the output $\pi$ of min-Pick-Perm satisfies

[TABLE]

The result follows by combining the above inequalities. $\blacksquare$

*Remark III.1**.*

Let $(i^{*},k^{*})$ be the optimal indices generated by min-Pick-Perm. Define $\tilde{\Sigma}^{k^{*}}=\{\sigma_{i^{*}}^{k^{*}}\}$ and let

[TABLE]

for $j\in[C]/\{k^{*}\}$ . A $c-$ approximate solution for the $med-\star$ problem with input $\{\tilde{\Sigma}_{k}\}_{k\in C}$ , denoted by $\pi^{\prime}$ , satisfies

[TABLE]

Hence, $\pi^{\prime}$ is a $2c-$ approximation for the original $min-\star$ problem. Therefore, convex optimization and rounding can be used on the $med-\star$ problem. We refer to these adapted algorithms as min-mmKT-Conv and min-mmSP-Conv.

III-B Partial rankings

All the algorithms proposed for permutation aggregation generalize to partial ranking aggregation. One may easily show that as long as the distance $d_{\star}$ defined for partial rankings is a pseudometric (e.g., $\star\in\{K,prS\}$ ), the $2$ -approximation guarantees for all previous methods hold. To get a fractional solution in the program of mmKT-Conv, we have to change the constraint (4) to

[TABLE]

which does not depend on the type of output ranking. Also, note that $w_{xy}^{k}$ for partial rankings does not satisfy the equality $w_{xy}^{k}+w_{yx}^{k}=\lambda_{k}$ , although the triangle inequality $w_{xy}+w_{yz}\geqslant w_{xz}$ still holds. As the proof of Theorem III.2 only requires the later inequality, the same rounding procedure offers a $2$ -approximation. Also, in the optimization problem (7) one has to use the definition $\sigma(x)$ for partial rankings.

IV Simulations

We compare the performance of three families of algorithms: Convex optimization procedures with rounding (mmKT-Conv, mmSP-Conv, min-mmKT-Conv, min-mmSP-Conv), permutation selection (Pick-Rnd-Perm, Pick-Opt-Perm, min-Pick-Perm) and algorithms used for traditional min-median rank aggregation (FASLP-Pivot [2] and SP-Matching [9]). The comparison shows that algorithms based on convex optimization yield significantly better results than naive selection methods, and that traditional aggregation algorithms are poor candidates for solving MinMax problems.

First, we evaluate the proposed algorithms on synthetic data. The synthetic data is generated based on what we call a two-level Mallows model: First, we generate the permutations $\{\sigma^{1},...,\sigma^{C}\}$ independently based on the Mallows distribution $\mathbb{P}(\sigma^{k})\propto\phi_{1}^{d_{\tau}(\sigma^{k},e)}$ [13]. Then, for each class $k\in[C]$ , we generate $m_{k}$ permutations $\sigma_{1}^{k},...,\sigma_{m_{k}}^{k}$ independently according to the Mallows distribution $\mathbb{P}(\sigma_{i}^{k})\propto\phi_{2}^{d_{\tau}(\sigma_{i}^{k},\sigma^{k})}$ . We set the number of classes to $C=3$ , fix $\phi_{2}=0.7$ and let each class contain $m_{k}=10$ permutations. To control the distance between different classes, we choose $\phi_{1}$ from $\{0.5,0.7,0.9,1.0\}$ . The objective function values for $100$ independent samples, obtained by different algorithms, are shown Table I.

Our next test example comes from evolutionary biology, and is concerned with Mitochondrial DNA (mtDNA) genome aggregation. The aggregate in this case corresponds to an ancestral genome. The most common used rearrangement distance between two nuclear genomes is based on reversals [15], but mitochondrial DNA rearrangement studies have also involved the Kendall $\tau$ distance [6]. In the latter case, the authors only considered the median problem $C=1$ , although the min-max problem is equally relevant [8, 3]. In our experiment, we used the mtDNA dataset from [5]. The dataset contains $11$ metazoan genomes with $36$ gene-blocks in some arrangement. We removed the “signs” of gene orders and let each genome represent one class, so that $C=11$ and $m_{k}=1$ for all $k$ ; we fixed $\lambda_{k}=1$ . Table II shows the results. Due to page limitations, we relegate the significantly more space consuming empirical study of weighted multiclass mtDNA aggregation to the extended version of the paper.

Bibliography16

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Nir Ailon. Aggregation of partial rankings, p-ratings and top-m lists. Algorithmica , 57(2):284–300, 2010.
2[2] Nir Ailon, Moses Charikar, and Alantha Newman. Aggregating inconsistent information: ranking and clustering. Journal of the ACM (JACM) , 55(5):23, 2008.
3[3] Christian Bachmaier, Franz J Brandenburg, Andreas Gleißner, and Andreas Hofmeier. On the hardness of maximum rank aggregation problems. Journal of Discrete Algorithms , 31:2–13, 2015.
4[4] John Bartholdi III, Craig A Tovey, and Michael A Trick. Voting schemes for which it can be difficult to tell who won the election. Social Choice and welfare , 6(2):157–165, 1989.
5[5] Guillaume Bourque and Pavel A Pevzner. Genome-scale evolution: reconstructing gene orders in the ancestral species. Genome research , 12(1):26–36, 2002.
6[6] Kamalika Chaudhuri, Kevin Chen, Radu Mihaescu, and Satish Rao. On the tandem duplication-random loss model of genome rearrangement. In Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm , pages 564–570. Society for Industrial and Applied Mathematics, 2006.
7[7] Persi Diaconis and Ronald L Graham. Spearman’s footrule as a measure of disarray. Journal of the Royal Statistical Society. Series B (Methodological) , pages 262–268, 1977.
8[8] Liviu P Dinu and Radu Ionescu. An efficient rank based approach for closest string and closest substring. P Lo S One , 7(6):e 37576, 2012.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Multiclass MinMax Rank Aggregation

Abstract

I Introduction

II Mathematical Preliminaries

Definition II.1**.**

Definition II.2**.**

III Approximate MinMax Aggregation

III-A Permutations

Theorem III.1**.**

Proof.

Theorem III.2**.**

Lemma III.3**.**

Proof.

Theorem III.4**.**

Proof.

Theorem III.5**.**

Proof.

Remark III.1*.*

III-B Partial rankings

IV Simulations

Definition II.1.

Definition II.2.

Theorem III.1.

Theorem III.2.

Lemma III.3.

Theorem III.4.

Theorem III.5.

*Remark III.1**.*