Numerical explorations of feasibility algorithms for finding points in the intersection of finite sets
Heinz H. Bauschke, Sylvain Gretchko, and Walaa M. Moursi

TL;DR
This paper investigates the effectiveness of various projection algorithms in solving feasibility problems involving finite sets, highlighting how algorithm choice and tuning parameters influence convergence in nonconvex scenarios.
Contribution
It provides a numerical analysis of four projection methods applied to finite set feasibility problems, exploring their behavior across different configurations.
Findings
Algorithm choice significantly affects convergence behavior.
Tuning parameters play a crucial role in the success of the methods.
Different problem configurations exhibit varied local and global convergence patterns.
Abstract
Projection methods are popular algorithms for iteratively solving feasibility problems in Euclidean or even Hilbert spaces. They employ (selections of) nearest point mappings to generate sequences that are designed to approximate a point in the intersection of a collection of constraint sets. Theoretical properties of projection methods are fairly well understood when the underlying constraint sets are convex; however, convergence results for the nonconvex case are more complicated and typically only local. In this paper, we explore the perhaps simplest instance of a feasibility algorithm, namely when each constraint set consists of only finitely many points. We numerically investigate four constellations: either few or many constraint sets, with either few or many points. Each constellation is tackled by four popular projection methods each of which features a tuning parameter. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Variational Analysis · Advanced Optimization Algorithms Research · Multi-Criteria Decision Making
11institutetext: Heinz H. Bauschke 22institutetext: Mathematics, UBCO, ASC 352, 3187 University Way, Kelowna, B.C. V1V 1V7, Canada, 22email: [email protected] 33institutetext: Sylvain Gretchko 44institutetext: Mathematics, UBCO, ASC 352, 3187 University Way, Kelowna, B.C. V1V 1V7, Canada, 44email: [email protected], 55institutetext: Walaa M. Moursi 66institutetext: Electrical Engineering, Stanford University, 350 Serra Mall, Stanford, CA 94305, USA,
and Mansoura University, Faculty of Science, Mathematics Department, Mansoura 35516, Egypt
66email: [email protected]
Numerical explorations of feasibility algorithms for
finding points in the intersection of finite sets
Heinz H. Bauschke
Sylvain Gretchko
and Walaa M. Moursi
Abstract
Projection methods are popular algorithms for iteratively solving feasibility problems in Euclidean or even Hilbert spaces. They employ (selections of) nearest point mappings to generate sequences that are designed to approximate a point in the intersection of a collection of constraint sets. Theoretical properties of projection methods are fairly well understood when the underlying constraint sets are convex; however, convergence results for the nonconvex case are more complicated and typically only local. In this paper, we explore the perhaps simplest instance of a feasibility algorithm, namely when each constraint set consists of only finitely many points. We numerically investigate four constellations: either few or many constraint sets, with either few or many points. Each constellation is tackled by four popular projection methods each of which features a tuning parameter. We examine the behaviour for a single and for a multitude of orbits, and we also consider local and global behaviour. Our findings demonstrate the importance of the choice of the algorithm and that of the tuning parameter.
Keywords:
c
yclic Douglas–Rachford algorithm, Douglas–Rachford algorithm, extrapolated parallel projection method, method of cyclic projections, nonconvex feasibility problem, optimization algorithm, projection
AMS 2010 Subject Classification: 49M20, 49M27, 49M37, 65K05, 65K10, 90C25, 90C26, 90C30
.1 Introduction
\runinhead
Background
Let be a Euclidean space (i.e., a finite-dimensional Hilbert space), with inner product and norm . The feasibility problem is a common problem in science and engineering: given finitely many closed subsets of , it asks to
[TABLE]
We henceforth assume that the intersection is nonempty. Algorithms for solving (FP) exist when the constraint sets allow for simple projectors (i.e., nearest-point mappings). When is convex, then the projector is a nice (firmly nonexpansive and single-valued) operator defined on the entire space ; when is not convex, then is nonempty and set-valued. For notational simplicity, we will use to denote an arbitrary but fixed selection of the set-valued projector. (If is a subset of , then is a minimizer of the function , where . For other notions not explicitly defined in this paper, we refer the reader to BGMbib-BBor .)
Assuming that the operators are readily available and implementable, one may try to solve (FP) iteratively by generating a sequence of vectors in that employs the projection operators in some fashion to produce the next update. There are hundreds of papers dealing with algorithms for solving convex or nonconvex feasibility problems. Thus, we refrain from providing a comprehensive list of references and rather point to the following recent books and “meta” papers as starting points: BGMbib-BBor ; BGMbib-BC ; BGMbib-BK ; BGMbib-Ceg ; BGMbib-CZak ; BGMbib-CZ ; BGMbib-Com97a ; BGMbib-Com97b . (We note that the recent manuscript BGMbib-BDL deals with a feasibility problem where one set is a doubleton.) The convergence theory in the nonconvex case is much more challenging and usually of local character.
\runinhead
Goal of this paper
The goal of this paper is to showcase the surprising numerical complexity of the most simple instance of (FP); namely, when each constraint set {svgraybox}
[TABLE]
In this case, the projection operator is very easy to implement — this is achieved by simply measuring the distance of the point to each point in and returning the closest one. Furthermore, we will restrict ourselves to the simple case when the underlying space {svgraybox}
[TABLE]
is simply the Euclidean plane. Even in this setting, the difficulty and richness of the dynamic behaviour is impressively illustrated.
It is our hope that the complexity revealed will spark further analytical research in feasibility algorithms with the goal to explain the observed complexity and ultimately to aid in the design of new algorithms for solving difficult feasibility problems.
\runinhead
Organization of the paper
The remainder of the paper is organized as follows. In Section .2, we present the four constellations we will use for our numerical exploration throughout the remainder of the paper. These constellations correspond to feasibility problems that we will attempt to solve using the algorithms listed in Section .3. Section .4 provides details on the implementation and execution of the numerical experiments. The “best” tuning parameter is determined in Section .5. We then track typical orbits of the algorithms in Section .6. Local and global behaviour is investigated in Section .7. Some interesting (and beautiful) behaviour outside of the main numerical experiments are collected in Section .8. The final Section .9 contains some concluding remarks.
.2 The four constellations
Even though we restrict ourselves already to finitely many constraint sets with finitely many points in the Euclidean plane, the infinitely many possibilities to experiment make it a daunting task to explore this space. We opted to probe this universe as follows.
The points in each constraint set are chosen randomly. We will ensure that the origin belongs to each set {svgraybox}
[TABLE]
to have a consistent feasibly problem with {svgraybox}
[TABLE]
We will focus on two alternatives for the number of constraint sets, either “few” or “many”. We will also consider constraint sets with a maximum number of points in the constraint sets, either “few” or “many”. From now on, we will use the following language: {svgraybox}
- •
The number of few sets is .
- •
The number of many sets is .
- •
The number of few points is .
- •
The number of many points is .
This will give rise to four constellations: few sets with few points, few sets with many points, many sets with few points, and many sets with many points. The four constellations used in our numerical experiments are shown in Figure 1.
.3 The four feasibility algorithms
We will numerically solve instances of (FP) using four algorithms which we briefly review in this section. While there is a myriad of competing algorithms available, our selection consists of trustworthy “work horses” that have been employed elsewhere and for which the convergence theory in the convex case is fairly well understood. Each of these algorithms has a “tuning” parameter in the range . The default value is . Guided by experiments, we will also (numerically) look for the “best” value . We now turn to these four algorithms. Each algorithm will have a governing sequence driving the iteration, and a (possibly different) monitored sequence which is meant to find a solution of (FP).
\runinhead
Cyclic Projections (CycP)
Given , the governing sequence is defined by {svgraybox}
[TABLE]
The default parameter is , from the range . The sequence monitored is \big{(}\tfrac{1}{m}\sum_{i=1}^{m}P_{C_{i}}x_{k}\big{)}_{k\in\mathbb{N}}. Selected references: BGMbib-BBor ; BGMbib-BC ; BGMbib-BK ; BGMbib-Ceg ; BGMbib-CCCDH ; BGMbib-CZ ; BGMbib-Com97b .
\runinhead
Extrapolated Parallel Projections (ExParP)
Given , the governing and monitored sequence is defined by {svgraybox}
[TABLE]
if ; otherwise. The default parameter is , from the range . Selected references: BGMbib-BC ; BGMbib-BCK ; BGMbib-Com97a .
\runinhead
Douglas–Rachford (DR)
Given , , , and , the next iterate is , where {svgraybox}
[TABLE]
The default parameter is , from the range . The sequence monitored is . Selected references: BGMbib-BC ; BGMbib-BM ; BGMbib-EckBer ; BGMbib-ERT ; BGMbib-LM .
\runinhead
Cyclic Douglas–Rachford (CycDR)
Given , the governing sequence is defined by {svgraybox}
[TABLE]
The default parameter is , from the range . The sequence monitored is \big{(}\tfrac{1}{m}\sum_{i=1}^{m}P_{C_{i}}x_{k}\big{)}_{k\in\mathbb{N}}. Selected references: BGMbib-Luke ; BGMbib-DP18a ; BGMbib-LST . (For other cyclic version of DR, see BGMbib-BNP ; BGMbib-BT . Also, if and , then is actually unbounded when and .)
.4 Setting up the numerical explorations
\runinhead
Stopping criteria The feasibility measure
[TABLE]
where , vanishes exactly when . We stop each algorithm with monitored sequence either when
[TABLE]
or when the maximum number of iterations, which we set to 1000, is reached. These values were chosen to allow a reasonable exploration of the feasibility problem while maintaining computational efficiency.
\runinhead
Details on program A program was developed in C++ to run the different experiments, see Figure 2 for two screenshots which we describe next. In the main tab of the user interface one can select the algorithm to be used and set up the problem to be solved by choosing the number of sets and the maximum number of elements per set. By clicking on the diagram showing the current constellation of points, the user can select a starting point and immediately observe the resulting orbit being rendered over the constellation. The graph of the feasibility measure , corresponding to the current orbit, is also displayed.
The Cartographer tab allows the exploration of a very large number of starting points to construct a picture of the performance of a given algorithm. This two-dimensional plot shows for each starting point the number of iterations required to solve the problem, ranging from zero (black) to the maximum number of iterations allowed (white). The plot is generated progressively and uses Quasi-Monte Carlo sampling for the selection of the starting points. This is the most computationally intensive part of the software, and it is fully multi-threaded to take advantage of modern processor architectures.
.5 Determining the “best” parameter
In this section, we consider our four constellations (see Section .2) and run on each of them the four algorithms (see Section .3) with the parameter ranging over . The curves shown in Fig. 3, 4, 5, and 6 give an estimate of the success rate of each algorithm, evaluated for 200 evenly-spaced values of . For each value of , 5000 starting points are drawn from using Quasi-Monte Carlo sampling, and the success rate is estimated by dividing the number of times the algorithm is successful by this number of starting points. Thus, a “best” parameter is determined. It is this parameter that we will use to compare with the default parameter , which is in all cases.
\runinhead
Discussion
For each of the four constellations considered above, we visually inspected the -curves indicating success rates. We then picked for each algorithm a parameter called which improved performance over the default parameter . The results are recorded in the following table.
We will use the parameters for the experiments in subsequent sections.
.6 Tracking orbits
In this section, we consider our four given constellations (see Section .2). For each constellation, which is organized in a separate subsection, the same starting point is used. We then consider each of our four fixed algorithms (see Section .3) and show orbits for and for (see Section .5), and the corresponding feasibility measure (see Section .4).
.6.1 Few sets with few points
.6.2 Few sets with many points
.6.3 Many sets with few points
.6.4 Many sets with many points
.6.5 Discussion
The numerical results in this subsection suggest the following: The most challenging constellation is the one with few sets and many points. The least challenging constellation is the one with many sets and few points for which all algorithms are successful.
The worst algorithm is CycP. ExParP with solves all four constellations. DR solves all constellations but has to be chosen appropriately. CycDR works well with in terms of number of iterations required; however, it was not able to solve the constellation with few sets and many points.
The experiments in this section suggest that (i) ExParP, DR, and CycDR are algorithms worthwhile exploring and that (ii) experimenting with may lead to improved convergence.
Because the results in this section feature a fixed starting point, we will explore in the next section the four constellations for a multitude of starting points.
.7 Local and global behaviour
In this section, we continue to consider our four constellations (see Section .2) which our four algorithms attempt to solve (see Section .3).
In contrast to Section .6 where we tracked a single orbit, we here illustrate local and global behaviour of the algorithms for a multitude of starting points, sampled from and , respectively. We do this for and for (see the table in Figure 7 in Section .5);
For each starting point in the given range, these plots display as gray levels the number of iterations the algorithm needed in its attempt to solve the problem represented by the given constellation. Black corresponds to the minimum number of iterations (zero), and white to the maximum number of iterations (1000). The latter is obtained when the algorithm is unsuccessful. Therefore, the darker the image, the better the performance.
To quantitatively assess the performance of each algorithm, success rates are also provided. These are obtained by dividing the number of times the algorithm is successful by the number of starting points used.
Each of these images was generated using at least 15 million starting points. Depending on the constellation, the time required to generate these pictures ranged between a few minutes to about 3 hours using a quad-core computer.
.7.1 Few sets with few points
.7.2 Few sets with many points
.7.3 Many sets with few points
.7.4 Many sets with many points
.7.5 Discussion
Comparing the success rates reported in the figures above, it appears that ExParP, DR, and CycDR are good choices; we recommend that CycP be not used. The effect of the tuning parameter is very striking for most algorithms when comparing performance of with .
.8 Divertissements
We experimented also with other constellations and encountered some interesting behaviour of ExParP. This algorithm seems to exhibit fractal-like behaviour for some constellations — whether they are created randomly or not. In the following, we present three images that we found particularly delightful in Figure 16 and Figure 17.
.9 Concluding remarks
We encountered a somewhat surprising complexity in the behaviour of four algorithms for solving feasibility problems in a simple nonconvex case. The importance of the tuning parameter is apparent as is the proximity to solutions (local vs global behaviour). Further studies are needed to find effective guidelines for users in terms of choice of algorithms and the choice of the parameter . Finally, and similarly to BGMbib-Boretal , we encountered beauty in our numerical explorations. It is our hope that others will join us and explore theoretically and numerically this fascinating universe of constellations.
Acknowledgements
We thank the referee for constructive comments and suggestions. The research of HHB was partially supported by NSERC.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) Bauschke, H.H., Borwein, J.M.: On projection algorithms for solving convex feasibility problems. SIAM Rev. 38 , 367–426 (1996)
- 2(2) Bauschke, H.H., Combettes, P.L.: Convex Analysis and Monotone Operator Theory in Hilbert Spaces , second edition. Springer, Cham (2017)
- 3(3) Bauschke, H.H., Combettes, P.L., Kruk, S.G.: Extrapolation algorithm for affine-convex feasibility problems. Numer. Algorithms 41 , 239–274 (2006)
- 4(4) Bauschke, H.H., Koch, V.R.: Projection methods: Swiss army knives for solving feasibility and best approximation problems with halfspaces. Contemp. Math. 636 , 1–40 (2015) doi: 10.1090/conm/636/12726
- 5(5) Bauschke, H.H., Lindstrom, S.B., Dao, M.N.: The Douglas–Rachford algorithm for a hyperplane and a doubleton. J. Glob. Optim. , to appear. https://arxiv.org/abs/1804.08880 [math.OC] (2018)
- 6(6) Bauschke, H.H., Moursi, W.M.: On the Douglas–Rachford algorithm. Math. Prog. (Ser. A) 164 , 263–284 (2017)
- 7(7) Bauschke, H.H., Noll, D., Phan, H.M.: Linear and strong convergence of algorithms involving averaged nonexpansive operators. J. Math. Anal. Appl. 421 , 1–20 (2015)
- 8(8) Borwein, J.M., Lindstrom, S.B., Sims, B., Schneider, A., Skerritt, M.P.: Dynamics of the Douglas–Rachford method for ellipses and p 𝑝 p -spheres. Set-Valued Var. Anal. 26 , 385–403 (2018)
