Exact Recovery with Symmetries for the Doubly-Stochastic Relaxation
Nadav Dym

TL;DR
This paper analyzes the conditions under which convex relaxations for graph matching with symmetries are exact, focusing on the role of symmetry groups and providing algorithms for recovering graph isomorphisms.
Contribution
It characterizes when convex relaxations are exact for symmetric graph matching problems and introduces algorithms for retrieving isomorphisms in these cases.
Findings
Convex exactness depends on the symmetry group of the graphs.
For reflective symmetry groups with at least one full orbit, convex exactness holds almost everywhere.
The proposed algorithms effectively retrieve isomorphisms when convex exactness is satisfied.
Abstract
Graph matching or quadratic assignment, is the problem of labeling the vertices of two graphs so that they are as similar as possible. A common method for approximately solving the NP-hard graph matching problem is relaxing it to a convex optimization problem over the set of doubly stochastic (DS) matrices. Recent analysis has shown that for almost all pairs of isomorphic and asymmetric graphs, the DS relaxation succeeds in correctly retrieving the isomorphism between the graphs. Our goal in this paper is to analyze the case of symmetric isomorphic graphs. This goal is motivated by shape matching applications where the graphs of interest usually have reflective symmetry. For symmetric problems the graph matching problem has multiple isomorphisms and so convex relaxations admit all convex combinations of these isomorphisms as viable solutions. If the convex relaxation does not admit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Exact Recovery with Symmetries for the Doubly-Stochastic Relaxation
Nadav Dym
Weizmann Institute of Science
Abstract
Graph matching or quadratic assignment, is the problem of labeling the vertices of two graphs so that they are as similar as possible. A common method for approximately solving the NP-hard graph matching problem is relaxing it to a convex optimization problem over the set of doubly stochastic (DS) matrices. Recent analysis has shown that for almost all pairs of isomorphic and asymmetric graphs, the DS relaxation succeeds in correctly retrieving the isomorphism between the graphs. Our goal in this paper is to analyze the case of symmetric isomorphic graphs. This goal is motivated by shape matching applications where the graphs of interest usually have reflective symmetry.
For symmetric problems the graph matching problem has multiple isomorphisms and so convex relaxations admit all convex combinations of these isomorphisms as viable solutions. If the convex relaxation does not admit any additional superfluous solution we say that it is convex exact.
We show that convex exactness depends strongly on the symmetry group of the graphs; For a fixed symmetry group , either the DS relaxation will be convex exact for almost all pairs of isomorphic graphs with symmetry group , or the DS relaxation will fail for all such pairs. We show that for reflective groups with at least one full orbit convex exactness holds almost everywhere, and provide some simple examples of non-reflective symmetry groups for which convex exactness always fails.
When convex exactness holds, the isomorphisms of the graphs are the extreme points of the convex solution set. We suggest an efficient algorithm for retrieving an isomorphism in this case. We also show that the ”convex to concave” projection method will also retrieve an isomorphism in this case, and show experimentally that this projection method as well as the standard Euclidean projection will succeed in retrieving an isomorphism for near isomorphic graphs as well.
In certain cases it is sufficient to find the centroid of the set of isomorphisms, which gives a ”fuzzy encoding” of the symmetries of the shape. We show that for any symmetry group , the centroid solution can be recovered efficiently for almost all pairs of isomorphic graphs with symmetry group . Additionally we show that for such isomorphic graphs interior-point solvers will generally return the centroid solution.
1 Introduction
Graph matching and graph isomorphism are classical problems in computer science. In this paper we will use the term graph for a pair , where are the vertices of the graph, and is a symmetric matrix encoding the relationship between the vertices. We will also sometimes refer to alone as a graph. An isomorphism between graphs and is a relabeling of the vertices of so that and the relabeled are identical. The graphs and are isomorphic if there is an isomorphism between them. In matrix notation, an isomorphism is a permutation matrix such that or equivalently . The problem of deciding whether two graphs are isomorphic is known as the Graph isomorphism problem (GI). It is not known to be in P, but is also not known to be NP-hard. Recently [Babai, 2016] provided a quasi-polynomial time algorithm for GI. While no polynomial algorithm for the general GI problem is known, there are many families of graphs for which GI can be solved in polynomial time. One example which is relevant for this work is graphs with simple spectrum, or more generally bounded eigenvalue multiplicity [Babai et al., 1982].
The graph matching problem is the problem of determining how close two graphs are to being isomorphic by minimizing the graph matching energy over the set of permutation matrices which we denote by :
[TABLE]
This optimization problem is also often referred to as the Koopmans-Beckmann quadratic assignment problem, and is usually phrased as the equivalent problem of maximizing . In contrast to GI whose computational status is not fully known, global minimization of quadratic assignment, and even approximation to within a constant factor, is known to be NP-hard [Sahni and Gonzalez, 1976].
Graph matching problems have found many applications. See for example [Conte et al., 2004] for a survey on applications of graph matching for pattern recognition. Our work is motivated by shape matching applications: Shape matching is the problem of measuring how similar two given surfaces are. The notion of similarity between shapes is required to be invariant to shape preserving deformations such as rigid transformations for rigid objects (e.g., chairs), and deformations which preserve geodesic distances for non-rigid objects (e.g., humans). Accordingly shape matching problems are often modeled (e.g., [Mémoli, 2007, Mémoli, 2011, Solomon et al., 2016]) as the problem of finding a mapping between two surfaces so that they are as isometric as possible. The metric on the shapes is typically either the extrinsic Euclidean metric for rigid shapes, or the intrinsic geodesic metric for non-rigid shapes.
Finding near-isometries between shapes can be phrased as a graph matching problem by selecting a finite sampling of the shapes to obtain vertices on the two shapes, and taking to be the distance matrices defined by the distances on the shapes, that is
[TABLE]
In this setting an isomorphism between and corresponds to an isometry between the sampled metric spaces.
In this work we will focus on symmetric graphs, which are very relevant for shape matching applications since most natural shapes have intrinsic symmetries- that is, intrinsic isometries from the shape to itself other than the trivial identity mapping. Figure 1 shows some representative shapes from the [Giorgi et al., 2007] shape matching dataset. Typically natural shapes have a symmetry group with only two elements (bilateral symmetry) as in the left hand side of Figure 1, but there are interesting examples with larger symmetry groups as in the right hand side of Figure 1.
The doubly-stochastic relaxation
In this paper we focus on analyzing the doubly-stochastic (DS) relaxation for graph matching. For a survey on other convex relaxation and combinatorial methods which have been proposed to achieve good solutions for quadratic assignment see [Loiola et al., 2007].
The doubly stochastic (DS) relaxation replaces the NP hard graph matching problem with a tractable optimization problem by relaxing the combinatorial set of permutations to its convex hull of doubly stochastic matrices:
[TABLE]
which leads to a convex quadratic program known as the DS relaxation:
[TABLE]
We will refer to this optimization problem as . Since the DS relaxation minimizes over a larger domain, its minimum value is a lower bound for the minimal value of the graph matching problem. As can be expected due to the hardness of the problem, the DS relaxation does not generally return the global minimum or minimizer of (1) [Lyzinski et al., 2016]. In particular, [Scheinerman and Ullman, 2011] characterizes all cases in which the minimum of (2) is zero even when the graphs are not isomorphic.
We will be interested in the case where and are isomorphic. Note that in this case the global minimum of (2) is zero and thus coincides with the global minimum of (1). The interesting question is whether the DS relaxation succeeds in returning a minimizer which is a permutation. Clearly we do not expect this will be the case for all graphs since this would provide us with a polynomial time algorithm to solve GI. On the other hand, since there are many families of graphs for which GI is tractable, we can hope that for many instances the DS relaxation will be successful in returning a permutation solution. The recent works of [Aflalo et al., 2015, Fiori and Sapiro, 2015] show that indeed this is the case. To state their results we introduce some notation:
Let us denote the set of isomorphisms of by . We will say that is a convex isomorphism if it is a member of the set
[TABLE]
The inclusion
[TABLE]
is obvious. However it is possible that the DS relaxation will contain additional minimizers. We will say that is exact when this possibility does not occur and
[TABLE]
We note that the exactness property depends only on : An isomorphism defines a linear bijection
[TABLE]
from to and from to . Accordingly if are isomorphic, is exact if and only if is exact. We will refer to (convex) isomorphisms in the case as (convex) automorphisms. We also denote:
[TABLE]
We say that is an asymmetric graph if the identity matrix is its only automorphism. Otherwise we say that is a symmetric graph. A necessary condition for exactness of is that is asymmetric. This is because if has several automorphisms then due to the inclusion (3) and the convexity of
[TABLE]
Thus, while has a finite number of automorphisms, it has an infinite number of convex automorphisms. Even when is asymmetric, exactness does not always occur. A simple counter example will be discussed in Section 2. However, [Aflalo et al., 2015] showed that for asymmetric satisfying certain weak conditions exactness will hold. Their result was later shown to hold with even weaker conditions in [Fiori and Sapiro, 2015].
Convex exactness
Our goal in this paper is to show that for certain kinds of symmetry groups the DS relaxation can still be successfully applied, by defining a suitable notion of convex exactness. A similar goal has recently been achieved by [Dym and Lipman, 2016] for a semi-definite programming relaxation of the Procrustes matching problem.
We say that is convex exact if equality holds in (4), or equivalently if for any isomorphic to ,
[TABLE]
Note that for asymmetric graphs, convex exactness and exactness coincide. When convex exactness holds an isomorphism can be extracted in a tractable manner as we will discuss in Section 6.
For every permutation subgroup we define
[TABLE]
In the asymmetric case we know that there are such that is not (convex) exact, but also that (convex) exactness often does hold for asymmetric graphs. Our goal is to give a more precise notion of this claim by showing that for almost all asymmetric graphs (convex) exactness holds. More importantly, we would like to find non-trivial groups for which will be convex exact for almost every . To do so we must first define a natural measure on .
We will assume that is non-empty. Permutation groups , for which is empty do exist. A simple example is the cyclic group generated by the permutation
[TABLE]
Any satisfies
[TABLE]
Thus, all diagonal elements of are identical and all off-diagonal elements of are identical as well since . It follows that . If is non-empty we say that is a symmetry group.
For a symmetry group we consider the vector space
[TABLE]
Since is a vector space of some dimension it has a natural notion of measure- the dimensional Hausdorff measure on restricted to , or equivalently the push forward of the Lebesgue measure on to via a linear isometry between the two spaces. We denote this measure by . Note that
[TABLE]
Since by assumption is non-empty it follows that all the are strict subspaces of and therefore the complement of in has measure zero. Thus is a natural choice for a measure on . We will say that a property is generic, or that it holds for almost every , if it holds for almost every .
We can now state our main results:
1.1 Main results
Reflective groups
We show that convex exactness is a generic property for groups fulfilling the following two conditions:
Definition 1**.**
We say that is a reflection group if for all .
Any group defines an action on the set of vertices . We denote the orbit of by . In general we have that .
Definition 2**.**
We say that has a full orbit if it has an orbit of length .
In shape matching applications the full orbit assumption is typically fulfilled; an orbit will be full unless is on a symmetry axis of the shape. Under the full orbit and reflection group assumption, we prove:
Theorem 1**.**
Assume is a reflection group with a full orbit. Then the DS relaxation is convex exact with respect to almost all .
As a result we obtain that convex exactness is a generic property for the simplest but, in the context of shape matching applications, most important, symmetry groups:
Corollary 1**.**
If then the DS relaxation is convex exact with respect to almost all .
General groups
For general groups we provide a ”zero-one probability” result:
Theorem 2**.**
For any symmetry group one of the following holds:
The DS relaxation is convex exact with respect to almost every . 2. 2.
The DS relaxation is not convex exact for any .
The proof of Theorem 2 is constructive in the sense that it enables checking which of the two mutually exclusive alternatives described in the theorem hold for a given symmetry group . By using this strategy we can establish that there are quite simple non-reflective symmetry groups for which convex exactness fails. Figure 2 shows nine groups represented by nine shapes whose symmetry group is . For the first three groups (a)-(c) we found that convex exactness does not hold for any , while for the remaining groups convex exactness does hold for almost all . Note that all groups in the first column are isomorphic to , all groups in the second column are isomorphic to , and all groups in the last column are isomorphic to the dihedral group . Thus we see that while convex exactness is a generic property for any isomorphic to , in general different permutation groups can behave very differently with respect to the DS relaxation even if they are isomorphic in the sense of group theory.
Additional results: Permutation solutions and centroid solution
Since for symmetric problems the DS relaxation has an infinite number of convex isomorphisms, the question of achieving an ”interesting” convex isomorphism arises. Naturally we would like to achieve a convex isomorphism which is a permutation. In the case of convex exactness this reduces to the problem of finding an extreme point (a ”corner”) of the set of convex isomorphisms, which is known to be a tractable problem. In Section 6 we describe two known methods to obtain extreme points. Additionally we provide a much faster algorithm for achieving all isomorphisms between and . This algorithm is valid for almost all graphs whose symmetry group satisfy the conditions of Theorem 1.
A disadvantage of the methods mentioned above for finding permutation solutions is that they are constructed for perfectly isomorphic problems and are not suited for near isomorphic problems and are not used in practice. Instead, permutations are typically obtained using the projection or the more accurate, but more expensive, ”convex to concave” projection. We prove that when convex exactness holds, the convex to concave projection is able to return an isomorphism. For symmetric problems with a small amount of noise, we show experimentally that both projection methods are generally able to retrieve an isomorphism, and the convex to concave method is often able to retrieve an isomorphism for higher noise levels as well.
An alternative ”interesting” convex isomorphism which is easier to find than ”corners” is the ”centroid” of the set of isomorphisms:
[TABLE]
where is the set of isomorphisms between and . As advocated in [Solomon et al., 2012], finding in the case of symmetric problems can potentially be useful as it gives an ”encoding” of all isomorphisms of . In Section 5 we show that the centroid solution is easier to find than corner solutions: In fact, for any symmetry group , and almost every pair of isomorphic graphs , the centroid solution can be achieved (almost always) for any symmetry group. Additionally we show that for such penalty based optimization methods will converge to when solving .
An illustration of the centroid solution is shown in Figure 3, for the problem of mapping a cylinder to itself, using as the Euclidean distance matrix of the cylinder. The right part of the figure shows the cylinder, colored so that points in the same orbit of the symmetry group of the cylinder share the same color. The left part of the figure shows the matrix . Each yellow square in the left figure is a submatrix whose indices correspond to a circular section of the cylinder. It can be seen that the centroid solution assigns each point of the cylinder with equal probability to any other point in its orbit. We note that the centroid solution always has this property. Therefore different symmetry groups which have identical orbits will have the same centroid solution, and so the symmetry group cannot generally be reconstructed from the centroid solution.
The remainder of the paper is organized as follows: In Section 2 we define the notion of weak exactness which will be useful for the proofs presented later on. In Section 3 we prove convex exactness for reflective groups (Theorem 1). In Section 4 we prove our ”zero-one probability result” (Theorem 2) and explain how to check which of the two alternatives described in the theorem apply for a given group. In Section 5 we discuss the issue of retrieving the centroid solutions and finally in Section 6 we discuss the issue of retrieving isomorphisms in the case that convex exactness holds.
2 Weak exactness
An important tool for the proofs we present later on is the concept of weak exactness which we will now define: For any set of permutation matrices we define
[TABLE]
We say that is weakly exact if all convex automorphisms of are in . Less formally, this means that the coordinate of a convex automorphism can be positive only if there is an automorphism taking to . If is weakly exact and is isomorphic to then all convex isomorphisms of are in . Weak exactness is guaranteed with full probability for any symmetry group . We show this using the vector
[TABLE]
The vector is invariant under automorphisms, meaning that if then . We say that is discriminative if for any such that , we have . We prove
Theorem 3**.**
Let be any symmetry group. Then
If is discriminative then is weakly exact. 2. 2.
For almost every , the vector is discriminative.
We prove Theorem 3. We first prove that if is discriminative then is weakly exact. If is a convex isomorphism, then is fixed by because
[TABLE]
Write as a convex combination of permutations . Using the fact that the operator norm of a permutation is one and the Cauchy-Schwartz inequality we obtain
[TABLE]
so is an equality, implying that for all . Now if then for all and therefore . Thus we have proven that is weakly exact when is discriminative.
We now show that discriminativeness is a generic property. It is sufficient to show that for almost every the claim holds since is a subset of . Note that is discriminative unless there are some such that is not in but is in the vector space
[TABLE]
Thus it is sufficient to show that all these spaces are strict subspaces of , which we accomplish by finding a member for which is discriminative.
To construct let be the partition of the vertices induced by the action of . For all and we set
[TABLE]
If we set . The constructed graph is a member of , and the vector is discriminative since for all and we have . This concludes the proof of Theorem 3.
Counter example
While weak exactness is guaranteed almost everywhere, it can still fail in very simple examples. Such examples can be constructed using the fact that if 1 is an eigenvector of then is always a valid convex automorphism. For example the graph
[TABLE]
is asymmetric, but satisfies for and so is a convex isomorphism, and so weak exactness, and certainly exactness, does not hold.
One method for overcoming such counter examples is adding a linear term to the graph matching energy penalizing for correspondences which do not respect isomorphism-invariants. For example, For each vertex we can define to be the sorted values of the -th row of the graph. Clearly if then so is an isomorphism invariant. Since in our example are all distinct, the only zero-energy solution of the modified relaxation
[TABLE]
is the identity matrix.
3 Convex exactness for reflective groups
Our goal in this section is proving convex exactness holds generically for reflective groups with a full orbit (Theorem 1). We break up the proof of the theorem into two parts: The first part establishes sufficient conditions which guarantee exact recovery, and the second part proves these sufficient conditions hold generically if is reflective and has a full orbit.
Fix some and . As in the previous section let be the partition of the vertices induced by the action of , and denote . Let be a convex automorphism of . Denote by and the submatrices of corresponding to the indices . If is weakly exact then whenever and so the equation takes the form
[TABLE]
3.1 Sufficient conditions for convex exactness
Proposition 3.1**.**
Let be a graph. If such that
The vector is discriminative. 2. 2.
* for all .* 3. 3.
* has simple spectrum.*
then is convex exact.
Proof of Proposition 3.1.
The first condition guarantees weak exactness, and thus that all convex isomorphisms will satisfy (8). Setting in this equation we obtain
[TABLE]
and so by the second condition is determined uniquely by . It follows that the restriction of the linear map
[TABLE]
to is injective. Therefore it is sufficient to show that is a convex combination of the permutation matrices obtained by restricting the automorphisms to . By taking in (8) we see that is a convex automorphism of the subgraph , and the group
[TABLE]
is a subgroup of which acts transitively on . By the third assumption has simple spectrum. Thus to show is a convex combination of elements of it is sufficient to prove
Lemma 1**.**
If is a graph with simple spectrum, and acts transitively on the vertices , then and is convex exact.
We now conclude the proof of the proposition by proving the lemma. In this proof denotes the -th column of the matrix , and denotes the -th row of .
To prove the lemma it is sufficient to show that
[TABLE]
because this implies that
[TABLE]
which proves that is convex exact. Additionally all automorphisms are in , and since permutations are extreme points of DS this can only occur if , and so .
We prove (9) using an argument from [Dym and Lipman, 2016]; If is a convex automorphism of and is an eigenvector of with eigenvalue , Then
[TABLE]
so either or is an eigenvector of with eigenvalue . Since has simple spectrum it follows that for some . If is an automorphism of then .
Let be a convex automorphism of . We want to show that . Since acts transitively on , there are permutation matrices such that for any vector
[TABLE]
Denote by the matrix whose columns are the eigenvectors of . Then there is a diagonal matrix and diagonal matrices such that
[TABLE]
Note that is a convex combination of if and only if is a convex combination of . If is the -th eigenvector of then
[TABLE]
so in particular has no zero coordinates.
From (10) we obtain
[TABLE]
and therefore
[TABLE]
Since all entries of are non-zero the only diagonal matrix solving the equation above is the zero matrix. Thus we obtain as a convex combination of :
[TABLE]
∎
3.2 Genericity of the sufficient conditions
In this subsection we prove the sufficient conditions of Proposition 3.1 hold generically if is reflective and has a full orbit. The first condition was proved to hold generically for any symmetry group in Theorem 3. We choose the appearing in the last two conditions of of Proposition 3.1 such that is a full orbit, or equivalently . We begin with some preliminaries.
Preliminaries
Recall that for a symmetry group of dimension , the measure can be defined as the restriction to of the -dimensional Hausdorff measure on . We cite some basic properties of the Hausdorff measure and dimension from chapter 2 in [Falconer, 2004] which will be helpful for the proof of Lemma 3.
If has Hausdorff dimension and , then . 2. 2.
If is a submanifold of dimension , then its Hausdorff dimension is as well. 3. 3.
If and is countable, then . 4. 4.
If then . 5. 5.
If and is Lipschitz, then .
An immediate consequence is that the latter inequality holds if is a function defined on all of . To see this denote and note that the restriction of to is Lipschitz. Therefore
[TABLE]
For Lemma 4 we will need the following simple lemma. We include a proof for completeness:
Lemma 2**.**
If is a non-zero multivariate polynomial , then the set has Lebesgue measure zero.
Proof.
By induction. For the claim is obvious. We assume the claim holds for and show it holds for . Rewrite as
[TABLE]
By the induction hypothesis the set
[TABLE]
has measure zero in . For any fixed in the complement of , is a univariate non-zero polynomial and has zeros in a (finite) subset of of measure zero. Using Fubini’s theorem this implies that the set has Lebesgue measure zero. ∎
Proof of genericity
A graph is in if it is symmetric and (8) is satisfied when is replaced with all permutations . This means that where
[TABLE]
and for
[TABLE]
Thus to prove the second condition is generic it is sufficient to show that almost every has simple spectrum, and that to prove the third condition is generic we need to show that almost every has full rank. Thus the second condition follows by setting and in the following Lemma:
Lemma 3**.**
If is reflective then almost all has simple spectrum.
Proof.
Since is reflective all satisfy
[TABLE]
Members commute because
[TABLE]
Thus can be diagonalized simultaneously, and so we can partition into a direct sum of eigenspaces . We denote the dimension of each eigenspace by . Select for each subspace a matrix whose columns form an orthogonal eigenbasis of , and denote
[TABLE]
A graph is in if and only if it is symmetric and it commutes with the members of . This in turn occurs if and only if and all members of can be diagonalized simultaneously, and so there are symmetric matrices such that
[TABLE]
It follows that can be identified with , and is thus of dimension
[TABLE]
For we define to be the diagonal matrix whose diagonal entries are , and define
[TABLE]
Now consider the function defined by
[TABLE]
The image of is precisely . Moreover the dimension of the domain of is
[TABLE]
The complement of the set of graphs with simple spectrum is a union of sets of the form where
[TABLE]
Since the dimension of each such set is strictly smaller than the dimension of the domain we obtain:
[TABLE]
and so the complement of the set of graphs with simple spectrum is dimension deficient and thus has zero Hausdorff measure. ∎
We now prove the third condition holds generically.
Lemma 4**.**
If has a full orbit then almost every has full rank.
Proof.
In this proof we denote members of the vector space by . We identify this vector space with for some via a linear isomorphism , and define a multivariate polynomial by
[TABLE]
Note that if and only if has full rank. Thus due to Lemma 2 it is sufficient to show that isn’t identically zero, or equivalently, establish the existence of a full rank matrix in . We now construct such a matrix which we will denote by .
Note that if and only if
[TABLE]
This means that the values of are required to be constant along the orbits of the action of the group on defined by
[TABLE]
We choose some orbit of this action and define by the requirement that if are member of this orbit, and otherwise . By construction and it remains to verify that it has full rank. The orbit has elements where the are all distinct, and we can order the orbit so that the first elements of the sequence are distinct as well. Thus
[TABLE]
and so . ∎
4 Almost all or nothing
In this section we prove Theorem 2; we show that for any symmetry group , either the DS relaxation is never convex exact for any , or the DS relaxation is convex exact for almost every . We then explain how generic convex exactness can be established/refuted for a given group .
Our proof uses another notion of exactness which we will call affine exactness: The affine automorphisms of a graph are the members of the affine set
[TABLE]
We note that affine automorphisms and convex automorphisms differ in two aspects: On the one hand the entries of convex automorphisms are required to be non-negative while the entries of affine automorphisms are not. On the other hand, affine automorphisms must be members of , a requirement we do not impose on convex automorphisms (although by Theorem 3 convex automorphisms will ”usually” satisfy this property).
We say that affine exactness holds at if
[TABLE]
We begin by establishing a connection between affine exactness and convex exactness:
Proposition 4.1**.**
For any graph , convex exactness holds at if and only if affine exactness holds at and
[TABLE]
Note that the RHS of (13) is always contained in the LHS.
Proof.
If affine exactness and (13) hold then
[TABLE]
so convex exactness holds as well.
Now assume convex exactness holds. Then (13) holds since
[TABLE]
To show affine exactness holds we choose some and show that . The ”centroid” convex automorphism is non-zero in all coordinates except for coordinates which satisfy for all automorphisms . Since at such coordinates is also zero, there is some such that
[TABLE]
is doubly stochastic. is also an affine automorphism, and thus is a convex automorphism. By assumption is a convex combination of members of . In particular and therefore so is
[TABLE]
∎
Note that the condition (13) depends only on and not on a specific choice of . Thus if the condition does not hold then will not be convex exact for any . If is such that the condition does hold then convex exactness for specific is equivalent to affine exactness. Thus Theorem 2 follows from the following proposition:
Proposition 4.2**.**
Let be a symmetry group. Then either affine recovery holds for almost every , Or affine recovery fails for all .
Proof of Proposition 4.2.
As in the proof of Lemma 4 we show the set of for which affine recovery fails is a null set of a suitable multivariate polynomial.
For given , the affine automorphisms of are the matrices satisfying the affine equations defining . Denoting the map which identifies matrices with vectors by , these equations can be written in the form
[TABLE]
where depends linearly on . Since , all members of are solutions of (14). Thus the kernel of always includes
[TABLE]
and affine exactness holds iff . Let
[TABLE]
be a unitary matrix, such that forms an orthonormal basis of . Then affine exactness holds iff has full rank. Now pick some linear isometry from to the vector space . affine exactness holds at iff is not a zero of the multivariate polynomial
[TABLE]
This concludes the proof of the proposition, due to Lemma 2. ∎
Checking exactness for given groups
We now explain how we check whether convex exactness holds generically for the groups defined by the shapes in Figure 2. We first note that condition (13) holds for these groups. This is because all contain a full orbit. If has a full orbit and is an affine combination of members of , then the coefficients of the affine combination are just the values of the column . In particular if is doubly stochastic then the affine combination is in fact a convex combination since is a probability vector.
Since (13) holds we have generic convex exactness for if and only if the polynomial is a non-zero polynomial. This can be checked either by computing the polynomial symbolically or by evaluating it on random input. We used the latter method: For each of the groups we generated random graphs in and evaluated the polynomial on these graphs. For groups all evaluations of the polynomial were zero, and for the remaining graphs the polynomial was found to be non-zero at all evaluated points. These results are summarized in the first column of Table 1.
5 Centroid
Recall that the centroid solution is the matrix obtained by averaging over all members of as defined in 6. In this section we show that for any symmetry group and almost every the centroid solution can be recovered efficiently. We also show that is the solution which will be obtained by interior-point methods when solving .
We begin by giving an explicit construction of . Let us denote as before the equivalence classes of the action of on by . Assume that the vertices are arranged so that
[TABLE]
Set , and for any integer let be the constant matrix whose entries are all . Note that is in and is invariant under multiplication from the left or right by elements of . The only doubly stochastic matrix satisfying these properties is
[TABLE]
and therefore . Recall that for any symmetry group and almost every the vector is discriminative. In this case the centroid solution can be easily computed without even solving the DS relaxation: We first construct a matrix by setting if and otherwise. We can then obtain by normalizing the rows of .
Interior point algorithms
Interior point algorithms solve (2) by solving problems of the form
[TABLE]
and taking to obtain a solution for (2). The function is chosen so that it explodes at the boundary, and so the constraints will never be active in (15). A common choice [Wright and Nocedal, 1999] for is . Specialized solvers for (2) such as [Rangarajan et al., 1996, Solomon et al., 2016] often use . Note that while this does not explode at the boundary, its derivatives do.
To include both choices of , is well as other possible choices, we will deal with general which are of the form
[TABLE]
where is continuous and strictly convex and if .
Theorem 4**.**
Let be a symmetry group, and be a function satisfying the conditions described previously. Then for almost every the unique minimizers of (15) converge to as tends to zero.
Proof.
We assume that is weakly exact. This assumption holds for almost every . By passing to a subsequence we can assume that converges to some in the compact set DS. We need to show that .
We note that for any
[TABLE]
Since is the unique minimizer of this equality implies that is invariant under multiplication by elements of from the right and the left. Thus this is true for as well. Due to continuity is bounded from below and so it can be shown that
[TABLE]
It follows that is a convex automorphism and since is weakly exact . Since we also showed to be invariant under multiplication by from the left and right it follows that .
∎
6 Retrieving isomorphisms
In this section we discuss how convex exactness can be used to retrieve isomorphisms. We will discuss two classes of methods. The first class searches for extreme points of the convex set of convex isomorphisms. We will show that under the assumptions of Proposition 3.1 all isomorphisms of the graphs can be retrieved quite efficiently. However finding extreme points is not a stable methods for retrieving isomorphisms once noise is introduced. This leads to the second class of methods, which we call projection methods. Projection methods are the methods typically used in practice to achieve a permutation solution from the original solution of the DS relaxation. We show theoretically that the popular ”convex to concave” projection method is able to retrieve a correct isomorphism, and explore experimentally the behavior of this method as well as the projection method when noise is introduced.
6.1 Finding extreme points
When convex exactness holds, finding an isomorphism is reduced to the problem of finding an extreme point of the optimal set defined by the linear constraints
[TABLE]
An extreme point(=basic feasible solution) of this linear feasibility program can be found using the simplex algorithm. Extreme points can also be found using interior point algorithms by optimizing a random linear energy over . In [Dym and Lipman, 2016] a similar problem is discussed, and it is shown that if the linear energies are randomly drawn from the uniform distribution on , then with probability one the obtained linear program will have a unique solution, which will be an extreme point. Moreover all extreme points will be obtained with equal probability.
Table 1 shows the successfulness of the latter method in returning isomorphisms for symmetric problems in which convex exactness holds. For each of the nine symmetry groups defined by the shapes in Figure 2 we generated random graphs in according to the distribution . For each such graph we then found an extreme point by maximizing a random linear energy over the optimal set. As shown in Table 1 for the graphs for which convex exactness holds generically, this algorithm succeeded in returning a permutation in all experiments. For the groups for which convex exactness does not hold, this algorithm returned permutations in more than half of the experiments, but non-integer solutions were also obtained. This is due to the fact that the optimal set contains non-integer extreme points in this case.
Next we suggest a more efficient method for obtaining all extreme points of the set of convex isomorphisms, under the assumption that the assumptions of Proposition 3.1 hold and is not too large .
If is discriminative, then the centroid solution can be found directly as described in the previous section.
Once a convex isomorphism was found, we use the technique of [Pataki, 1996] to find an extreme point. We now describe this technique:
We begin with some preliminaries: For , we say that if whenever . We say that if but the converse inequality does not hold.
A face of a convex set is a subset such that for all and satisfying
[TABLE]
necessarily . An extreme point is a face which is a singleton. If is a convex compact set then it is the convex hull of its extreme points . Moreover, for each face ,
[TABLE]
Every defines a face
[TABLE]
and an affine space obtained from by removing the positivity constraints, i.e.,
[TABLE]
We note that is in the relative interior of . This means that for all there is a sufficiently small such that . The boundary of in is the set:
[TABLE]
We can now describe the algorithm of [Pataki, 1996]:
We are given as input some and set and . 2. 2.
We compute a spanning subset to the affine space . If then is an extreme point and we are done. 3. 3.
Otherwise we choose some in . We then find the unique such that
[TABLE]
and set to be the matrix on the left hand side. We then return to the previous step.
The iterative process can only terminate when . This will necessarily occur after a finite number of steps since always has more zeros than . In the convex exact case, a permutation will be attained within steps. This is because each face is strictly contained in the former face and therefore according to (16) the number of extreme points=permutations in is strictly smaller than the number of extreme points in .
Once a permutation is obtained, an additional permutation can be sought for by repeating the process above, but beginning with where is the smallest possible so that is doubly stochastic. This choice gives an initial convex isomorphism such that , guaranteeing that the algorithm will return a new permutation . In the next step we can set and continue in this manner until we obtain a collection of isomorphisms , and is a convex combination of these isomorphisms. In fact under the full orbit assumption will be all the isomorphisms. This is because can be written as a positive convex combination of all members of , and the members of are linearly independent, implying that this is the only possible convex combination giving , so that that all isomorphisms were obtained. The linear independence of follows from the fact that it has full orbit, and so each isomorphism has a non-zero coordinate on which all other isomorphisms vanish.
From a computational perspective, under the conditions of Proposition 3.1, The algorithm above will return an isomorphism within steps, and all isomorphisms within iterations. Computing the first affine space is basically the problem of finding a linear basis to the solution set of the linear equations defining . Since has at most non-zero entries, this is a linear equation in variables instead of variables. For finding the subsequent affine spaces additional computational saving can be obtained due to the fact that . Thus all elements in are affine combinations of spanning element of the affine space , so is obtained by solving a linear equation in only variables.
Figure 4 shows the results of applying the algorithm described above to find the symmetries of a grid. The grid has a reflective symmetry group with full orbit and thus fulfills the conditions of Theorem 1. We took to be the Euclidean distance matrix of the grid (here ) and used the algorithm described above to obtain all symmetries of the grid. In our implementation in Matlab this calculation took around ten seconds.
6.2 Projection methods
The classical approach [Aflalo et al., 2015] for projecting a permutation solution from the doubly stochastic relaxation is using the standard projection, which can be implemented as a linear program and solved efficiently using the Hungarian algorithm. See [Zaslavskiy et al., 2009] for more details. A more accurate and more computationally demanding method is the ”convex to concave” method. We will explain this method in the formulation used in the DS++ algorithm [Dym et al., 2017]. Similar suggestions appear in [Zaslavskiy et al., 2009, Ogier and Beyer, 1990]. We then prove DS++ obtains a permutation solution in the convex exact case (up to some technicalities which will be explained), and examine the behavior of both projection methods when noise is added.
Convex to concave projection
The convex to concave method sequentially solves optimization problems of the form
[TABLE]
The strictly concave function
[TABLE]
is non-negative on DS, and if and only if is a permutation. Additionally if is sufficiently large so that is strictly concave, then the (global and local) minima of (17) will necessarily be permutations since the minima of a strictly concave function on a convex compact set are always extreme points. Thus the global minimum of the relaxed and the original quadratic assignment problem are identical. Note however that since (17) is not convex computing the global minimum is no longer tractable.
Building on this observation, the convex to concave method minimizes (locally) a sequence of optimization problems of the form (17) on a sequence of choices
[TABLE]
and in each step uses the obtained solution as a warm start to the optimization of . The first point is selected so that is convex, and the last point is selected so that is strictly concave and thus the obtained local minima is guaranteed to be a permutation.
The first point can be selected to be zero to ensure that is convex. However a better selection is where is the minimal eigenvalue of the quadratic form
[TABLE]
when restricted to the subspace
[TABLE]
Similarly the last point is selected to be (slightly larger than) the maximal eigenvalue of the same quadratic form over the same subspace. This choice ensures that is (strictly) concave. The remaining points can be uniformly sampled in the interval (for lack of a better strategy).
Note that if and are isomorphic, then for any the global minimizers of are precisely (while for the global minimizers are ). This observation suggests the ”convex to concave” method may be successful in retrieving isomorphisms even for symmetric problems, and possibly could return integer solutions even for . We now give a theoretical justification for these observations.
We assume that we obtain each from a local minimization algorithm with the following properties:
Monotonicity: . 2. 2.
The first-order necessary condition (KKT conditions) for local minima is satisfied at . 3. 3.
The second-order necessary condition for local minima of is satisfied at . That is
[TABLE]
Here is the Hessian of the quadratic form .
Under these assumptions we prove
Theorem 5**.**
Assume and are isomorphic and the DS relaxation is convex exact at . Assume satisfy conditions (1)-(3). Then if is sufficiently close to
[TABLE]
The theorem is proved in Appendix A.
Isomorphism retrieval for noisy problems
We examine the behavior of the DS relaxation coupled with the projections described above for noisy symmetric problems by conducting the following experiment:
We construct a random bilaterally symmetric graphs and choose . We then perturb these graphs by two randomly selected symmetric matrices , and solve the DS relaxation using both projection methods. We do this for and for matrices with Frobenius norm where we use ten values of uniformly chosen from the interval . The graph is chosen by computing an isometry where is a permutation subgroup with two elements, and then sampling a vector uniformly from the unit sphere to obtain . For each fixed value of we repeat different instances of the experiment, and compute the retrieval ratio of both methods, which we define as the number of times the method returned a permutation from divided by the number of experiments (100). The results are shown in Figure 5.
It can be seen that both methods succeed in retrieving a correct permutation at low noise levels, but the convex-to-concave method (denoted by DS++) is more successful than the projection method (denoted by DS) at higher noise levels. In the case we also add the ”groud truth retrieval ratio”, that is the number of instances in which the global minimizer of the graph matching energy was indeed in divided by the number of experiments. It can be seen that as the noise level approaches the noise ”takes over the problem” and the members of are no longer the global minimizers. The ground truth solutions was obtained by the semi-definite relaxation of [Kezurer et al., 2015] which is known to be very tight, though computationally expensive. We verify that the solution obtained from the semi-definite relaxation is indeed the correct solution by checking that the difference between the lower bound provided by the relaxation and the upper bound provided by projecting the solution of the relaxation are negligible.
As a side note, we observe that at low noise levels DS++ obtains a solution in after two iterations in accordance with Theorem 5, and that even at high noise levels a permutation solution is usually attained after four iterations. This indicates that it might be worthwhile to choose a smaller , or alternatively to consider less steps in the convex-to-concave process.
Appendix A Convex to concave
Proof of Theorem 5.
We begin with some preliminaries. First note that if is an isomorphism, then due to the monotonicity condition and thus is an isomorphism.
In the asymmetric case the claim is trivial: Since is convex its local minimizers are also global minimizers. Since in the asymmetric case is the unique isomorphism between and is the only global minimizer for any it follows that is that unique minimizer. Therefore in this proof we will focus on the symmetric case only.
In the symmetric case there are at least two isomorphisms . Thus is an eigenvector of the energy with eigenvalue and so .
Our claim follows easily from the following lemma:
Lemma 5**.**
There exists an open set containing such that for all , The only points satisfying the first and second order conditions for local minimization of are the members of .
To obtain the theorem from the lemma, let be the minimum of on the compact set . For any sufficiently small so that we obtain
[TABLE]
It follows that and since it satisfies the first and second order conditions for local minimization of it follows that .
Proof of Lemma 5.
We construct for each an open set satisfying the properties required from and then choose
[TABLE]
If is a convex isomorphism but not a permutation we choose
[TABLE]
Fix some , we claim that the second-order necessary condition for minimizing is not satisfied at for any . Since is a convex combination of isomorphisms we can choose an isomorphism such that , and so . Since are both zeros of the convex quadratic form , it follows that the second-order condition does not hold since (denoting by the Hessian of )
[TABLE]
For isomorphisms we choose as follows:
For any the concavity of implies
[TABLE]
In particular this is true for any in the compact set
[TABLE]
Since is the function
[TABLE]
is continuous. Thus, there is a neighborhood of on which
[TABLE]
Fix some . Define
[TABLE]
Note that and therefore there is some such that . It follows from (19) that
[TABLE]
and therefore
[TABLE]
The convexity of implies that for all ,
[TABLE]
From the last two equations it follows that for any the energy has a descent direction at any point . Not that this direction is orthogonal to the gradients of the constraints defining DS, since is feasible if is small enough. Thus the first-order condition does not hold at . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Aflalo et al., 2015] Aflalo, Y., Bronstein, A., and Kimmel, R. (2015). On convex relaxation of graph isomorphism. Proceedings of the National Academy of Sciences , 112(10):2942–2947.
- 2[Babai, 2016] Babai, L. (2016). Graph isomorphism in quasipolynomial time [extended abstract]. In Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing , pages 684–697. ACM.
- 3[Babai et al., 1982] Babai, L., Grigoryev, D. Y., and Mount, D. M. (1982). Isomorphism of graphs with bounded eigenvalue multiplicity. In Proceedings of the fourteenth annual ACM symposium on Theory of computing , pages 310–324. ACM.
- 4[Conte et al., 2004] Conte, D., Foggia, P., Sansone, C., and Vento, M. (2004). Thirty years of graph matching in pattern recognition. International journal of pattern recognition and artificial intelligence , 18(03):265–298.
- 5[Dym and Lipman, 2016] Dym, N. and Lipman, Y. (2016). Exact recovery with symmetries for procrustes matching. ar Xiv:1606.01548 .
- 6[Dym et al., 2017] Dym, N., Maron, H., and Lipman, Y. (2017). Ds++: A flexible, scalable, and provably tight relaxation for matching problems. ar Xiv:0902.0885 .
- 7[Falconer, 2004] Falconer, K. (2004). Fractal geometry: mathematical foundations and applications . John Wiley & Sons.
- 8[Fiori and Sapiro, 2015] Fiori, M. and Sapiro, G. (2015). On spectral properties for graph matching and graph isomorphism problems. Information and Inference , 4(1):63–76.
