Feasible bases for a polytope related to the Hamilton cycle problem
Thomas Kalinowski, Sogol Mohammadian

TL;DR
This paper investigates the structure of a polytope related to the Hamilton cycle problem, revealing its independence from certain parameters, its interpretation as a network flow polytope, and providing combinatorial insights and computational evidence.
Contribution
It advances understanding of the feasible bases of a Hamilton cycle-related polytope, including structural properties and a combinatorial interpretation, supporting a conjecture about Hamiltonian cycles.
Findings
Feasible bases are independent of the parameter near 1.
The polytope is a generalized network flow polytope.
A combinatorial interpretation of feasible bases is provided.
Abstract
We study a certain polytope depending on a graph and a parameter which arises from embedding the Hamiltonian cycle problem in a discounted Markov decision process. Eshragh \emph{et al.} conjectured a lower bound on the proportion of feasible bases corresponding to Hamiltonian cycles in the set of all feasible bases. We make progress towards a proof of the conjecture by proving results about the structure of feasible bases. In particular, we prove three main results: (1) the set of feasible bases is independent of the parameter when the parameter is close to 1, (2) the polytope can be interpreted as a generalized network flow polytope and (3) we deduce a combinatorial interpretation of the feasible bases. We also provide a full characterization for a special class of feasible bases, and we apply this to provide some computational support for the conjecture.
| 2 | 3 | 4 | 5 | 6 | total | |
|---|---|---|---|---|---|---|
| total | ||||||
| 2 | 3 | 4 | 5 | 6 | 7 | 8 | total | |
|---|---|---|---|---|---|---|---|---|
| total | ||||||||
| 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | total | |
|---|---|---|---|---|---|---|---|---|---|---|
| total | ||||||||||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Feasible bases for a polytope related to the Hamilton cycle problem
Thomas Kalinowski1,2
and
Sogol Mohammadian2
1School of Science and Technology, University of New England, NSW, Australia
2School of Mathematical and Physical Sciences, University of Newcastle, NSW, Australia
Abstract.
We study a certain polytope depending on a graph and a parameter which arises from embedding the Hamiltonian cycle problem in a discounted Markov decision process. [11] conjectured a lower bound on the proportion of feasible bases corresponding to Hamiltonian cycles in the set of all feasible bases. We make progress towards a proof of the conjecture by proving results about the structure of feasible bases. In particular, we prove three main results: (1) the set of feasible bases is independent of the parameter when the parameter is close to 1, (2) the polytope can be interpreted as a generalized network flow polytope and (3) we deduce a combinatorial interpretation of the feasible bases. We also provide a full characterization for a special class of feasible bases, and we apply this to provide some computational support for the conjecture.
Key words and phrases:
Hamilton cycle, polytope, feasible basis, random walk
2010 Mathematics Subject Classification:
90C27, 90C35
1. Introduction
The Hamilton Cycle Problem (HCP) is one of the classical problems in combinatorics. Given a graph , the problem is to decide if contains a cycle that visits each node exactly once. Cycles that pass through every node of a graph exactly once are called Hamilton cycles. If a graph contains at least one Hamilton cycle, then it is called Hamiltonian. Otherwise, it is non-Hamiltonian. The HCP is NP-complete even for planar graphs with maximum degree three for undirected graphs, and maximum degree two for directed graphs [16, 18], so it is unlikely that there is an exact algorithm which terminates in polynomial time and solves the problem in general.
The Traveling Salesman Problem (TSP) asks for a Hamilton cycle of minimum weight in an arc-weighted graph, and therefore the HCP is a special case of the TSP where all the weights are either zero or one. Due to its importance in many applications and its rich mathematical structure the TSP has attracted the attention of many researchers. In particular, it has been one of the major driving forces for the development of polyhedral techniques in combinatorial optimization (see [2] for a nice overview). The underlying idea is to identify a subset of the arc set of a graph on vertices with its characteristic vector in . Then the TSP is asking for the minimum of a linear function over the convex hull of the set of Hamilton cycles, and this convex hull is known as the traveling salesman polytope.
In 1994, [14] proposed a new approach to the HCP, based on the theory of Markov Decision Processes (MDPs). An MDP comprises a state space, an action space, transition probabilities between states (which depend on the actions taken by the decision maker) and a reward function. In the basic setting, the decision maker takes an action, receives a reward from the environment, and the environment changes its state. Next, the decision maker identifies the state of the environment, takes a further action, obtains a reward, and so forth. The state transitions are probabilistic, and depend solely on the state and the action taken by the decision maker. The reward obtained by the decision maker depends on the action taken, and on the current state of the environment. The decision maker’s actions in each environmental state are prescribed by a policy. The model introduced by [14] initiated a new line of research and has attracted growing attention (see, for instance, [3, 4, 6, 7, 8, 9, 13, 15, 17]). In particular, [13] investigated the relationship between the HCP and discounted MDPs. In discounted MDPs, a discount factor , which represents the difference in importance between future and present rewards, is used to discount rewards. Feinberg presented a polytope, which we shall refer to as , constructed from an input graph . He showed that the Hamilton cycles of correspond to certain extreme points of the polytope , called Hamiltonian extreme points. [8] described some geometric properties of and [12] transformed to a combinatorially equivalent polytope to avoid certain numerical issues. Moreover, they constructed a new polytope by adding new linear constraints, called wedge constraints, to the polytope .
In 2011, [10] partitioned the extreme points of into five types, consisting of Hamiltonian extreme points and four types of non-Hamiltonian extreme points. They showed that for a discount factor sufficiently close to one, the wedge constraints cut off the non-Hamiltonian extreme points of types 2, 3 and 4, while preserving the Hamiltonian extreme points. In addition, they proposed to use a random walk on the extreme points (or on the feasible bases) of the polytopes and to search for a Hamilton cycle, and they showed that for a Hamiltonian graph , the random walk algorithm detects a Hamiltonian extreme point with probability one in a finite number of iterations.
For a more precise analysis of the efficiency of this random walk approach it is necessary to understand the combinatorial structure of the polytopes. More precisely, it is required to analyze the prevalence of Hamiltonian extreme points within the set of all extreme points, as well as the mixing properties of the random walk. This was the motivation for the work of [11] who established results on the combinatorial structure of the polytope . They characterized feasible bases of the polytope for a general input graph , and determined the expected numbers of different types of feasible bases when the underlying graph is random. They showed that for a random graph, the number of feasible bases corresponding to Hamiltonian extreme points of is exponentially small compared to the total number of feasible bases. Moreover, they demonstrated that the wedge constraints eliminate a large number of non-Hamiltonian feasible bases, and they provided computational evidence for the efficiency of the random walk on the feasible bases of . Based on their computational and theoretical results, they conjectured that for a random graph on nodes the ratio between the number of feasible bases corresponding to Hamilton cycles and the total number of feasible bases is asymptotically bounded below by for some positive constants and .
In this paper, we continue this line of research by studying the structure of the polytope . In particular, we are interested in characterizing feasible bases for this polytope. In Section 2, we introduce some notation and provide relevant background from the literature. Section 3 is about general results on feasible bases for . In particular, we prove that there is a constant such that for all graphs on vertices, the set of feasible bases for does not depend on as long as . Then we establish a close relationship between and a generalized network flow polytope for a graph that is obtained from by splitting each node into two nodes. The third result of Section 3 is a proof that any feasible basis contains a set of arcs forming a collection of node-disjoint cycles, such that in the corresponding basic feasible solution, the values of the variables corresponding to these arcs tend to as , while the values of the remaining basic variables tend to [math]. In Section 4, we use the relation to the generalized network flow polytope to give a complete characterization of feasible bases of which correspond to a Hamilton cycle in the generalized network flow setting.
2. Notation and background
Consider a digraph without loops and parallel arcs, where is the set of nodes and is the set of arcs. Throughout this paper, refers to such a digraph on nodes, unless otherwise stated. For each node , the in-neighborhood , and the out-neighborhood are the sets
[TABLE]
Furthermore, we use the following notations to denote the total inflow and total outflow for node , respectively:
[TABLE]
As indicated earlier, [13] defined a polytope depending on the graph , and showed that finding a Hamilton cycle is equivalent to finding an extreme point of this polytope whose support corresponds to a Hamilton cycle. The support of an extreme point is defined to be the set of its non-zero coordinates. More precisely, he proved the following theorem.
Theorem 2.1** ([13]).**
Consider a digraph , a parameter with , and let be the polytope defined by the constraints
[TABLE]
The graph is Hamiltonian if and only if there exists an extreme point of which has exactly positive coordinates tracing out a Hamilton cycle in .
[12] modified by a coordinate transformation for all . The resulting polytope is defined by the constraints
[TABLE]
Since values of close to one were shown to be important in [10, 12], this transformation eliminates numerical instability in (2.3). The following definition is motivated directly from Theorem 2.1.
Definition 1**.**
Let be an extreme point of the polytope . If the positive coordinates of trace out a Hamilton cycle in the graph , is called a Hamiltonian extreme point. Otherwise, it is called a non-Hamiltonian extreme point.
[12] observed that, if is the Hamiltonian extreme point corresponding to a Hamilton cycle then its components are given by if is the -th arc in starting from node 1, and if is not contained in . In particular, the Hamiltonian extreme points satisfy the wedge constraints
[TABLE]
which cut off some non-Hamiltonian extreme points. Adding the wedge constraints and introducing slack variables , we obtain the polytope described by the following constraints:
[TABLE]
A basis for this polytope can be specified by a triple , where is the set of basic variables, and the sets and are non-basic -variables at their lower and upper bounds, respectively. In other words, is the partition of the set , such that for and for . By a slight abuse of notation, we will simply call the triple a basis. The set can be identified with the union , where is the set of arcs such that is a basic variable, and is the set of nodes such that is a basic variable. A basis can then be interpreted as a node-colored digraph on the node set : the arc set is and the color classes are , , and . If the unique solution of the system of equations (2.10)–(2.13) corresponding to satisfy the lower and upper bound constrains (2.14)–(2.15), then the basis is feasible, otherwise it is infeasible.
Remark 1*.*
For the extreme point corresponding to a basis , the total in and outflows of the nodes in are as follows
[TABLE]
[12] introduced the concept of quasi-Hamiltonian extreme points (bases) to search for Hamilton cycles among the extreme points (or the feasible bases) of the polytope .
Definition 2**.**
An extreme point of is called quasi-Hamiltonian if any walk , where for , is a Hamilton cycle in . A feasible basis corresponding to a quasi-Hamiltonian extreme point is called quasi-Hamiltonian basis.
For a positive integer and a probability , (which may depend on ), let be the graph on vertices obtained by adding each arc independently with probability , and then adding the arcs of a randomly chosen Hamilton cycle (this is very similar to the random graph model studied in [5]). Motivated by a random walk approach to the HCP for sparse random graphs, the following conjecture was made in [11].
Conjecture 1**.**
There exist positive constants , and such that for all , with high probability, the expected proportion of feasible bases of that are quasi-Hamiltonian is at least .
Proving this conjecture would be a first step towards a random walk based algorithm for the HCP on for very small .
3. General results
In this section we establish general results about feasible bases of . In Subsection 3.1, we show that the set of feasible bases for does not depend on the parameter for values of sufficiently close to one. Then, in Subsection 3.2, we demonstrate that can be interpreted as a generalized network flow polytope for a certain network on nodes obtained by splitting each node of into two nodes. The polyhedral theory of the generalized network flow problem implies that the bases in this setting correspond to subgraphs in which every connected component is an augmented tree, that is, a graph obtained from adding a single arc to a tree, thus creating a unique cycle. We prove that the feasible bases in the generalized network flow interpretation of are always connected, that is, a feasible basis corresponds to a spanning augmented tree. Finally, in Subsection 3.3, we show that in the original graph every feasible basis contains a spanning collection of node-disjoint cycles, such that the values of the variables corresponding to the arcs of these cycles are close to one, while the values of the remaining variables are close to zero.
3.1. Ultimate bases
Let be the set of feasible bases for . We first show that there exists such that whenever , that is, the set does not depend on the parameter , as long as it is sufficiently close to . Consider a triple with , where , and , and is a partition of the set . Let be the submatrix of the constraint matrix of the polytope corresponding to , set for and for , and let be the vector that is obtained from the right hand sides of (2.10) through (2.13), where for the in (2.13) is replaced by . The triple is a feasible basis for if and only if the following two conditions are satisfied:
- (1)
is invertible, that is, , and 2. (2)
the -components of the vector satisfy (2.14), and the -components satisfy (2.15).
We refer to these conditions as independence and feasibility, respectively. In the next two lemmas we show that these two properties do not depend on for sufficiently close to . The determinant is a polynomial in . If this is the zero polynomial, then the independence condition is not satisfied for any . On the other hand, if the determinant is not the zero polynomial, then is independent for all sufficiently close to . For the formal argument we let be the set of all sets of variables such that the corresponding columns are independent for some , that is,
[TABLE]
Since is finite, there is an such that none of the polynomials , , has a root in the open interval . This implies the following lemma.
Lemma 3.1**.**
There exists such that for all , and for all , , , with , the matrix is invertible for every .
Proof.
Define a function as follows:
[TABLE]
Then has the claimed property. It follows immediately from the construction that , and that for every and every with , . ∎
The next step is to show that feasibility is also independent of for values sufficiently close to .
Lemma 3.2**.**
There exists such that for all , and for all , the vector satisfies (2.14) and (2.15) if and only if satisfies (2.14) and (2.15).
Proof.
For , and let , let . The components of this vector are rational functions of with denominator , say
[TABLE]
Let . We define
[TABLE]
if is not identically zero, but vanishes for some , and otherwise. Similarly, we define
[TABLE]
if is not identically zero, but vanishes for some , and otherwise. Let
[TABLE]
and . Let . It follows from Lemma 3.1 and , that is invertible for every and every . Furthermore, as , none of the functions , , and changes sign on the interval , and this implies that has the claimed property. ∎
Combining Lemmas 3.1 and 3.2 we obtain the following result which justifies the subsequent definition.
Proposition 1**.**
There exists such that for all , .
Definition 3**.**
We call the elements of for ultimate bases for , and we denote the set of ultimate bases for by .
Henceforth, we consider ultimate bases of and we set where tends to zero. We omit the argument whenever there is no danger of confusion.
3.2. A generalized network flow formulation
In this subsection, we show that the polytope can be interpreted as a generalized network flow (GNF) polytope and we establish results about feasible bases of in the GNF setting. We first recall some definitions and results pertaining to the GNF polytope, see [1, Chapter 15] for details. Let be a digraph with capacities and positive rational multipliers for each arc , and demands/supply for each node . The GNF polytope for this data is defined by the following constraints:
[TABLE]
A possible interpretation is that for every unit of flow that enters arc in node , units arrive at node . The polytope given by (2.10) through (2.15) is a GNF polytope for the digraph obtained from by splitting each node into two nodes and , replacing arcs by , and adding arcs for . Then is the flow on arc , the multipliers are equal to for all arcs of the form , and equal to for all arcs of the form . More precisely, the digraph has node set and arc set given by
[TABLE]
We denote the two parts in the partition of as and , that is, and . This construction of is illustrated in Figure 1.
The constraints (2.10) to (2.14) determine the supply/demand vector as follows:
- (1)
Constraint (2.12) says that has a supply of , that is, . 2. (2)
Together with (2.10), this implies that has a demand of , that is, . 3. (3)
By constraint (2.13), node for has a supply of , that is . 4. (4)
From (2.13) and (2.11), it follows that, for ,
[TABLE]
hence node has demand , that is, .
The correspondence between the GNF in and the model (2.10) through (2.15) is illustrated in Figure 2. To summarize, the vector is given by
[TABLE]
The interpretation of as a GNF polytope allows us to apply the known results about the structure of bases for GNF polytopes to and identify bases of with subgraphs of . For this purpose, we introduce some terminology (following [1, Section 15.3]), and then state the characterization of bases of GNF polytopes in Theorem 3.1. For a cycle (not necessarily directed) with a given orientation, let and denote the sets of forward an backward arcs in . The cycle multiplier is
[TABLE]
If one unit of flow is sent along , starting at some node , then units return to this node. A cycle is called a breakeven cycle if .
Definition 4**.**
An augmented tree is a connected graph with exactly one cycle, called the extra cycle. An augmented forest is a collection of node-disjoint augmented trees.
Definition 5**.**
An augmented tree is called a good augmented tree if its extra cycle is not a breakeven cycle. An augmented forest as a good augmented forest if each of its components is a good augmented tree.
In our specific setting it turns out that the breakeven condition is equivalent to having the same number of forward and backward arcs.
Definition 6**.**
An oriented cycle is called balanced if it has the same number of forward and backward arcs.
Proposition 2**.**
A cycle in is breakeven if and only if it is balanced.
Proof.
Let be a cycle in , and let and denote the set of arcs of in and , respectively. As the multipliers of the arcs in are all equal to and the multipliers of the arcs in are all equal to , we have , where is the difference between the number of forward arcs and the number of backward arcs in . As a consequence, is a breakeven cycle if and only if contains the same number of forward and backward arcs. Let be the elements of in the order in which they are traversed by , and let be the paths into which is cut by deleting the arcs in . More precisely, is the path from to for , and is the path from to . Let be the difference between the number of forward and backward arcs on . If starts and ends at a forward arc then , if it starts and ends at a backward arc, then , and otherwise . Now we conclude, that is a breakeven cycle if and only if contains the same number of forward and backward arcs if and only if if and only if contains the same number of forward and backward arcs if and only if is balanced. ∎
From Proposition 2 we deduce that an augmented forest in is good if and only if none of its extra cycles is balanced. This is illustrated in Figure 3 where (3b) and (3c) depict an augmented tree and an augmented forest, respectively, for the digraph corresponding to the digraph in (3a). The tree in Figure 3(3b) is not a good augmented tree because the extra cycle is balanced. The forest in Figure 3(3c) is a good augmented forest as the extra cycles are not balanced.
Theorem 3.1** (Section 15.5 in [1]).**
Let be a directed graph. For a partition , the triple is a basis for a GNF polytope with underlying digraph if and only if is a good augmented forest which spans all the nodes of .
We denote the GNF interpretation of the polytope by . In Theorem 3.2 we strengthen Theorem 3.1 for the polytope by showing that in order to obtain a feasible basis, the augmented forest has to be connected, that is, it has to be a good augmented tree.
Theorem 3.2**.**
If is a feasible basis for , then is a good augmented tree.
This will be proved by showing that the assumption of more than one connected components leads to a contradiction. For this purpose we need some preliminary results which are stated in Lemmas 3.3–3.6. For the rest of this subsection, we assume that is a basis for . In particular, is a good augmented forest, and we assume it has connected components . We shall use to denote both the arc set and the node set of the good augmented tree . The intended meaning will be clear from the context. Without loss of generality, . For convenience, we set
[TABLE]
Thus, correspond to the arcs in which have both endpoints in , corresponds to the arcs in which enter , and corresponds to the arcs in which leave . The next lemma states that every component must contain both and for some .
Lemma 3.3**.**
For every , the extra cycle in contains an arc from . In particular, and , for .
Proof.
Fix , let be the extra cycle in , and assume that every arc in is in . Thus, in the nodes only have outgoing arcs and the nodes only have incoming arcs. This implies that is balanced which is a contradiction. ∎
In the next lemma we analyze the interaction between components. The arcs in can carry flow only if they correspond to basic variables. As a consequence, the flow transfer between connected components is only through arcs in , that is, arcs of the form .
Lemma 3.4**.**
[TABLE]
Proof.
Let . Summing (2.11) over all and then swapping the indices and , and splitting the sum into the terms for and the terms for , we obtain
[TABLE]
All the nonzero terms on the left hand side have , so we can rearrange the left hand side as follows:
[TABLE]
where for the second equality we used that for and , the variable can be nonzero only if . Therefore,
[TABLE]
which is equivalent to (3.1). For , we proceed similarly. We start by summing constraints (2.10) and (2.11) over all :
[TABLE]
As above, the left hand side is equal to , and therefore,
[TABLE]
which is equivalent to (3.2). ∎
Next we show that every component must have the same number of incoming and outgoing arcs in .
Lemma 3.5**.**
For every , .
Proof.
For every , the left hand side of (3.1) and (3.2), respectively, is
[TABLE]
For , the right hand side of (3.1) is
[TABLE]
and the right side of (3.2) is
[TABLE]
In both cases, we conclude that , and the claim follows because the left hand side is an integer. ∎
For , we have . As a consequence, which implies that . For , set
[TABLE]
and for , , or equivalently, if , and otherwise.
Lemma 3.6**.**
[TABLE]
Proof.
We start by bounding the left hand sides of the identities (3.1) and (3.2) from below:
[TABLE]
where the second inequality follows from Lemma 3.3. Next we express the sums on the right hand sides of (3.1) and (3.2) in terms of the quantities , , and . First consider the case . Then
[TABLE]
The right hand side of (3.1) is
[TABLE]
and with (3.5) we obtain (3.3). For ,
[TABLE]
The right hand side of (3.2) is
[TABLE]
and with (3.5) we obtain (3.4). ∎
We are now ready to finish the proof of Theorem 3.2.
Proof of Theorem 3.2.
It follows from (3.3) that , for , because implies which forces the left hand side of (3.3) to be zero. Then (3.3) implies , hence . On the other hand, by definition we have
[TABLE]
Therefore, , and (3.4) implies
[TABLE]
which simplifies to . If , then (3.2) becomes
[TABLE]
But for every , so the left hand side is , and this implies which contradicts the assumption that is not connected. Therefore, and , (3.4) becomes , and together with , we obtain and . Furthermore, , this implies and , and then for the unique node , and for the unique node . Now (3.2) becomes
[TABLE]
From , we have , and with , this implies . But then , which contradicts Lemma 3.3. ∎
3.3. Thick and thin arcs
In this subsection, we show that for a feasible basis of the polytope , the values of the basic variables either tend to one or zero as . More precisely, we show that there are exactly arcs corresponding to basic variables that tend to 1, and that these arcs form a collection of node-disjoint cycles which we call thick cycles. Furthermore, we provide upper bounds for the number of non-basic -variables which are at their lower bounds and the number of non-basic -variables which are at their upper bounds.
Definition 7**.**
Let , where and let . The arc is called thick with respect to , if . The arc is called thin with respect to if . Otherwise, the arc is called intermediate with respect to .
We start by expressing the sum of all flow variables in terms of .
Lemma 3.7**.**
For every point ,
[TABLE]
Proof.
The first two equalities follow from the observation that and are two different ways of computing the sum of all the , first by grouping the arcs according to their end node, and second by grouping them according to their start node. With , and for all , we obtain
[TABLE]
which implies
[TABLE]
Theorem 3.3**.**
Let , where and let , denote the sets of thick and thin arcs, respectively. Then
- (i)
, 2. (ii)
* forms a spanning collection of node-disjoint directed cycles,* 3. (iii)
for every , the digraph with arc set contains a directed path from node to node , 4. (iv)
* and .*
Proof.
- (i)
For the sake of contradiction, assume that there exists an intermediate arc . It follows from (2.10)–(2.13) that, for every , the total outflow and the total inflow tend to one as . Thus, there is at least one more intermediate arc leaving node , say . Similarly, nodes and have another intermediate incoming arc. This argument shows, that every node with an outgoing intermediate arc has at least two outgoing intermediate arcs, and every node with an incoming intermediate arc has at least two incoming intermediate arcs. As a consequence contains a cycle which alternates between forward and backward arcs, which implies that the corresponding cycle in is balanced. This contradicts the assumption that is a basis for . 2. (ii)
Using (2.13), (2.14), and (2.12), for every ,
[TABLE]
Thus, there is at most one thick arc leaving node . If there is a node with no thick leaving arc, then using part (i), the lower bound from the wedge constraints (2.9) and (2.10), we have
[TABLE]
which is a contradiction. Hence, in the subgraph every node has exactly one leaving arc. A similar argument shows that every node has exactly one entering thick arc, and therefore, is a spanning collection of node-disjoint directed cycles. 3. (iii)
For the sake of contradiction, let us assume that there is a node such that there is no directed path from node 1 to . Let be the set of nodes such that there exists a directed path from to . Since , (2.11) implies
[TABLE]
for all . Summing over all , we obtain
[TABLE]
The sum on the left hand side is over all arcs starting in and we split it into the sum over the arcs with both nodes in and the arcs starting in and ending in . The sum on the right hand side is over all arcs ending in , and by the definition of these arcs also start in . This gives
[TABLE]
which simplifies to
[TABLE]
With and the non-negativity of the , it follows that
[TABLE]
and this implies that for all , . In particular,
[TABLE]
which contradicts the wedge constraint (2.9). 4. (iv)
We use Lemma 3.7, together with for , and for :
[TABLE]
On the other hand, from and for every , we obtain
[TABLE]
and therefore,
[TABLE]
The first of these inequalities simplifies to , and the second one to .∎
Combining Theorems 3.2 and 3.3 we establish the following results on the structure of feasible bases of in the GNF setting.
Theorem 3.4**.**
Let where and let be the corresponding arc sets in , that is,
[TABLE]
- (i)
* is a good augmented tree spanning all nodes of .* 2. (ii)
* is the unique perfect matching in .* 3. (iii)
* is a collection of node-disjoint paths.*
Proof.
Item (i) follows from Theorem 3.2. Theorem 3.3 (ii) implies that the arcs in form a perfect matching in . With Lemma 3.3, there is at least one arc of the form in the extra cycle of . As a consequence, is a forest. Since a forest has at most one perfect matching, is the unique perfect matching. The last statement can be verified by adding the arcs in one-by-one to . Clearly, we start with node-disjoint paths, each of them being a single arc. And then in each step the new arc from connects the endpoints of two paths, thereby reducing the number of paths by 1. ∎
The relation between the characterizations of feasible bases for the polytope provided in Theorems 3.3 and 3.4 is illustrated in Figure 4.
4. A class of feasible bases
For the approach to the Hamiltonian Cycle Problem based on sampling extreme points of , an essential ingredient is a lower bound for the fraction of Hamiltonian extreme points in the set of all extreme points. In order to derive such an estimate it will be useful to understand the structure of the feasible bases corresponding to a fixed subgraph of or . By Theorem 3.3, a subgraph of which corresponds to a feasible basis contains a spanning collection of node-disjoint directed cycles. In , this corresponds to a perfect matching between the sets and , and Theorem 3.4 provides the additional information that the subgraph corresponding to the basic variables is a good augmented spanning tree.
In this section we make a further step by focusing on the feasible bases corresponding to a fixed (isomorphism class of) good augmented tree in . More precisely, we characterize the feasible bases whose corresponding good augmented tree is a Hamilton cycle in . This does not imply that the basis is Hamiltonian in the sense that it corresponds to a Hamilton cycle in (see Figure 5a for an illustration). From Section 3.2 we know that at least one -variable needs to be in the basis, that is, the good augmented tree must contain at least one arc of the form . We restrict our attention further to the case that the basis contains exactly one -variable, say . Starting with the thick arcs , and adding the arc corresponding to the basic -variable, we obtain isolated arcs and one path of length which consists of the thick arc into , the arc and the thick arc leaving . In order to obtain a Hamilton cycle, the thin arcs have to connect these components in such a way that for every , the node has exactly one outgoing arc, and node has exactly one incoming arc. In , this corresponds to every node having exactly one thick and one thin outgoing arc, and exactly one thick and one thin incoming arc, while the only arcs incident with node are the thick arcs into and out of . Up to relabeling the variables, we can then assume that the thick arc into node is the arc , and that the thick arc out of node is the -th thick arc if we traverse the Hamilton cycle in starting with the thick arc out of . More precisely, up to relabeling the variables, we may assume that there is a fixed-point free permutation of such that the following conditions are satisfied:
- •
The set of thick arcs is .
- •
The set of thin arcs is .
- •
if and only if .
In the last condition above, and throughout this section, whenever , refers to the node . For a basis , the set of basic variables is determined by the permutation , and we still have to choose a partition of the set of non-basic -variables such that for and for . In Figure 5, we illustrate how a basis can be completely specified by a picture, using different types of nodes to indicate if , or is a basis variable. The general structure of the bases considered in this section is illustrated in Figure 6.
Our aim is to characterize the 4-tuples of a node , a fixed-point free permutation with if and only if , and a partition , such that the basis given by the triple is feasible, where . The cycles of the permutation correspond to the thick cycles in . In particular, the basis is quasi-Hamiltonian if and only if the permutation is a single cycle.
In order to simplify the notation, we introduce new variables. Let , be the flow on the thick arc leaving node , and let , be the flow on the thin arc entering node . This is illustrated in Figure 7. For all , , and , and substituting the values of and leads to the following representation of the system (2.10)–(2.13):
[TABLE]
Lemma 4.1**.**
The quadruple encodes a feasible basis if and only if the unique solution of system (4.1)–(4.8) satisfies for all , and .
Proof.
If the quadruple encodes a feasible basis, then for all , and , or equivalently, . For the converse, let be the solution of the system (4.1)–(4.8) for , and assume that for all , and . To verify feasibility, we only need to show that for all . For , this is an immediate consequence of the assumption which implies . Using that the right hand sides of equations (4.2) to (4.7) are all , we obtain, for every
[TABLE]
and by induction for all . ∎
Lemma (4.1) implies that in order to characterize the feasible bases, we need to derive necessary and sufficient conditions for for all and . We start by finding necessary conditions. Expressing in terms of and (Lemma 4.2) allows us to specify and (Lemma 4.3), and deduce that for every feasible basis (Lemma 4.4). As a consequence, we find asymptotic expressions for (Lemma 4.5). Eliminating the -variables from the system (4.1)–(4.8) leads to a recursion for the , where the base case is given by the value determined in Lemma 4.5. The asymptotic expression obtained from solving this recursion is given in Lemma 4.6. Assuming that and have the right cardinalities and , the second order term of is positive whenever the first order term vanishes (Lemma 4.7). Theorem 4.1 combines the necessary conditions obtained so far into a characterization of feasible by three properties: (i) the cardinality condition for and , (ii) , and (iii) the non-negativity of the linear terms in the asymptotic expansions of the variables .
Lemma 4.2**.**
Let be the solution of the system (4.1)–(4.8) for . Then
[TABLE]
Proof.
We have
[TABLE]
where the first equation comes from Lemma 3.7, the second one follows from , , for and for , the third one from substituting for , and the last one from . The claim follows by solving for . ∎
Lemma 4.3**.**
If encodes a feasible basis, then and .
Proof.
We start with . If is even, then Theorem 3.3(iv) implies and , and the claim follows. For odd , Theorem 3.3(iv) implies only , and therefore . For the sake of contradiction, assume and . By Lemma 4.2,
[TABLE]
which is the required contradiction. ∎
Lemma 4.4**.**
If encodes a feasible basis then .
Proof.
Suppose . From Lemmas 4.2 and 4.3, we have that if is even, and if is odd. From (4.2) and (4.3), it follows that
[TABLE]
which is the required contradiction. ∎
In the next lemma, we use Lemma 4.2 to determine the asymptotics for the variables and assuming the necessary conditions from Lemma 4.3 and 4.4.
Lemma 4.5**.**
Suppose that and and .
- (i)
If is even then . 2. (ii)
If is odd then .
Proof.
We apply Lemma 4.2. For even ,
[TABLE]
and for odd ,
[TABLE]
In both cases, the claim follows since by (4.4) for . ∎
We set , and partition the set into nine sets with , where . In particular, . For every , we have and . Eliminating the -variables, we obtain the recursion
[TABLE]
where,
[TABLE]
We substitute for and take the limit for to obtain , where
[TABLE]
The base of the recursion is given by the values for in Lemma 4.5, and in order to capture this, we set , if is even and , if is odd. For , we define the set by
[TABLE]
and put for all . The following lemma is obtained from Lemma 4.5 and repeated application of (4.9).
Lemma 4.6**.**
If , and , then for all ,
[TABLE]
We want to derive necessary and sufficient conditions for . Clearly, it is necessary that , and if the inequality is strict then this is also sufficient. The next lemma deals with the equality case.
Lemma 4.7**.**
Let and suppose , and . For all ,
[TABLE]
Proof.
By assumption
[TABLE]
and thus
[TABLE]
For the second order term, we obtain
[TABLE]
Substituting the values for and we obtain for even ,
[TABLE]
and for odd ,
[TABLE]
Theorem 4.1**.**
For , the basis encoded by is feasible if and only if the following conditions are satisfied:
- (i)
* and ,* 2. (ii)
, 3. (iii)
For every , .
Proof.
Suppose that the basis encoded by is feasible. Conditions (i) and (ii) follow from Lemmas 4.3 and 4.4, respectively, and Condition (iii) from Lemmas 4.3 and 4.6 together with . Conversely, suppose that the three conditions in the theorem are satisfied. By Lemma 4.1, it is sufficient to verify and for all . The first of these conditions follows immediately from Lemma 4.5, and the second one from (iii) together with Lemmas 4.6 and 4.7. ∎
Next, we simplify Theorem 4.1(iii) and provide another characterization of feasible bases in Theorems 4.2 and 4.3. To do this, by using Lemmas 4.3 and 4.4 we deduce a restriction for the possible values of (Lemma 4.8). Furthermore, in Lemmas 4.9 and 4.10, we deduce conditions on the number of different types of arcs in various parts of the good augmented tree, where the type of an arc is determined by where it starts (in , , or ) and where it ends (in , or ).
Lemma 4.8**.**
If encodes a feasible basis, then if is odd, and if is even.
Proof.
By Lemmas 4.3 and 4.4, , and then (4.1) implies . If then (4.6) implies
[TABLE]
If is odd then , and for , (4.7) implies
[TABLE]
Lemma 4.9**.**
If and then .
Proof.
We start with which implies
[TABLE]
On the other hand, , hence
[TABLE]
From (4.13) and (4.14), we obtain
[TABLE]
Now
[TABLE]
where the last equality comes from the observation that if and if . ∎
Remark 2*.*
Lemma 4.9 implies that if and , then there exists a unique index such that and for all .
Lemma 4.10**.**
Suppose the quadruple encodes a feasible basis, and let . Then
- (i)
If is odd, then . 2. (ii)
If is even, then
- •
* for all , and*
- •
.
Proof.
Theorem 4.1(iii) together with (4.11) implies
[TABLE]
hence
[TABLE]
and therefore,
[TABLE]
If is odd then for all , and together with , we deduce . Now let be even, and assume that there exists such that . As before we apply Theorem 4.1(iii) together with (4.11) and :
[TABLE]
which is the required contradiction. ∎
In the next two theorems we show that the necessary conditions from Lemmas 4.3, 4.4, 4.8, and 4.10 are also sufficient.
Theorem 4.2**.**
Let . The quadruple encodes a feasible basis if and only if the following conditions are satisfied.
- (i)
* and ,* 2. (ii)
, 3. (iii)
, 4. (iv)
.
Proof.
If the basis corresponding to is feasible then conditions (i) to (iv) are implied by Lemmas 4.3, 4.8, 4.4, and 4.10. For the converse, suppose that the conditions are satisfied. From Theorem 4.1 together with (i)–(iii) it follows that it is sufficient to verify for all . From Lemma 4.9 with (ii) and (iii), it follows that is well defined, and from (iii), . Using this together with (v) and the definition of
[TABLE]
Now
[TABLE]
for all , and
[TABLE]
where the last equality is a consequence of . ∎
Theorem 4.3**.**
Let and let . The quadruple encodes a feasible basis if and only if the following conditions are satisfied.
- (i)
, 2. (ii)
, 3. (iii)
, 4. (iv)
, 5. (v)
* for all .*
Proof.
If the basis corresponding to is feasible then conditions (i) to (iv) are implied by Lemmas 4.3, 4.8 and 4.4, 4.10, and 4.10. For the converse, suppose that the conditions are satisfied. From Theorem 4.1 together with (i) and (ii) it follows that it is sufficient to verify for all . From Lemma 4.9 with (ii) and (iii), is well defined. Using (v) and the definition of
[TABLE]
Now
[TABLE]
We illustrate Theorems 4.1, 4.2, and 4.3 by checking the feasibility of the three bases represented in Figure 8.
It is immediately clear that all three examples satisfy the first two conditions in Theorem 4.1: the number of empty squares is at least the number of filled squares and at most the number of filled squares plus one, and the arc leaving the filled circle ends in a filled square.
Example 1** (Figure 8(8a)).**
We have , , , , and therefore
[TABLE]
which implies that the basis is infeasible. The fact that a negative partial sum occurs in the very end corresponds to the fact that the third condition in Theorem 4.3 is violated as .
Example 2** (Figure 8(8b)).**
Using , , , and , we obtain that all partial sums in are non-negative, and the basis is feasible. We get the same result by verifying the conditions in Theorem 4.2:
- (i)
There are filled squares and empty squares. 2. (ii)
The arc leaving the filled circle ends in a filled square. 3. (iii)
There is a thick arc from node to node . 4. (iv)
With , we have , hence .
Example 3** (Figure 8(8c)).**
Using , , , , and , we obtain that all partial sums in are non-negative, so the basis is feasible. We get the same result by verifying the conditions in Theorem 4.3:
- (i)
There are filled squares and empty squares. 2. (ii)
The thick arc leaving the filled circle ends in a filled square. 3. (iii)
The thick arc entering the filled circle starts in a filled square or the empty circle.
With , we have , hence , and then
- (iv)
, and 2. (v)
for all .
For small , we can use the characterizations given in Theorems 4.1, 4.2 and 4.3 to count feasible bases of the type considered in this section. For , this can be done by hand: By Theorem 4.2, and , in particular . Together with for all , and for all , this leaves only two permutations: and . Then is fixed because and , hence for and for . For this implies , and , and the basis is feasible because the partial sums of are all non-negative. For , and , so the basis is infeasible as . For , we implemented an algorithm that runs through all combinations and checks the feasibility conditions, using Theorems 4.1, 4.2 and 4.3 to limit the search space. The results for are presented in Tables 2 to 5. Here the columns (except the first, and for even the last) correspond to the values of , and the rows correspond to cycle types, where a tuple indicates that has cycles with lengths , and is the length of the cycle containing node . In particular, the first row corresponds to the quasi-Hamiltonian feasible bases. The shaded cells indicate the number of quasi-Hamiltonian feasible bases and the total number of feasible bases.
In these tables, we observe that the number of feasible bases decreases as the number of cycles increases, and that for a fixed number of cycles the number of feasible bases is larger if node is on a long cycle. In view of Conjecture 1, we are interested in the ratio between the number of quasi-Hamiltonian bases and the total number of feasible bases, and we would like this ratio to be bounded below by 1 divided by a polynomial function of . Defining and to be the numbers of quasi-Hamiltonian feasible bases and the total number of feasible bases in the considered class, respectively, we can summarize our counting results as shown in Table 6.
5. Summary and conclusion
In this paper we continue to study the polytope which was introduced by [12] in connection with a random walk based approach to the Hamilton Cycle Problem, and investigated further in [10, 11]. Two ingredients are needed in order to make this approach work: (1) if the input graph is Hamiltonian then there need to be sufficiently many feasible bases corresponding to Hamiltonian cycles, and (2) the graph of the polytope needs to have good mixing properties. In order to make progress towards establishing these two properties, a good understanding of the combinatorial structure of is needed, and in this paper we present significant results in this direction:
- (1)
The set of feasible bases does not depend on the value of as long as it is sufficiently close to 1. 2. (2)
can be interpreted as a generalized network flow polytope, and this interpretation leads to a nice graph theoretical interpretation of the structure of feasible bases. 3. (3)
For a special class of bases, we prove a complete characterization of the feasible bases. 4. (4)
We illustrate the characterization of feasible bases for small values of and present computational results supporting Conjecture 1.
Acknowledgement
We thank Hamish Waterer for pointing out the connection to generalized network flows.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Ravindra K. Ahuja, Thomas L. Magnanti and James B. Orlin “Network flows: Theory, Algorithms and Applications” Pearson, 1993
- 2[2] David L. Applegate, Robert E. Bixby, Vašek Chvátal and William J. Cook “The traveling salesman problem: a computational study” Princeton University Press, 2011
- 3[3] Konstantin Avrachenkov, Ali Eshragh and Jerzy A. Filar “On transition matrices of Markov chains corresponding to Hamiltonian cycles” In Annals of Operations Research 243.1-2 Springer, 2016, pp. 19–35
- 4[4] Vivek S. Borkar and Jerzy A. Filar “Markov chains, Hamiltonian cycles and volumes of convex bodies” In Journal of Global Optimization 55.3 Springer Nature, 2011, pp. 633–639 DOI: 10.1007/s 10898-011-9819-6 · doi ↗
- 5[5] Andrei Z. Broder, Alan M. Frieze and Eli Shamir “Finding hidden Hamiltonian cycles” In Random Structures & Algorithms 5.3 Wiley Online Library, 1994, pp. 395–410
- 6[6] Vladimir Ejov, Jerzy A. Filar and Jacek Gondzio “An interior point heuristic for the Hamiltonian cycle problem via Markov decision processes” In Journal of Global Optimization 29.3 Springer, 2004, pp. 315–334
- 7[7] Vladimir Ejov, Jerzy A. Filar, Walter Murray and Giang T. Nguyen “Determinants and longest cycles of graphs” In SIAM Journal on Discrete Mathematics 22.3 SIAM, 2008, pp. 1215–1225
- 8[8] Vladimir Ejov, Jerzy A. Filar, Michael Haythorpe and Giang T. Nguyen “Refined MDP-based branch-and-fix algorithm for the Hamiltonian cycle problem” In Mathematics of Operations Research 34.3 Informs, 2009, pp. 758–768
