Identification of points using disks
Valentin Gledel, Aline Parreau

TL;DR
This paper investigates the minimal number of disks needed to uniquely identify points in the plane, providing bounds, complexity results, and efficient algorithms under certain conditions.
Contribution
It establishes tight bounds on the number of disks for point identification, proves NP-completeness for fixed-radius disks, and offers a linear-time solution for colinear points.
Findings
Approximately n/3 disks suffice under general position
NP-completeness of fixed-radius disk identification
Linear-time algorithm for colinear points
Abstract
We consider the problem of identifying n points in the plane using disks, i.e., minimizing the number of disks so that each point is contained in a disk and no two points are in exactly the same set of disks. This problem can be seen as an instance of the test covering problem with geometric constraints on the tests. We give tight lower and upper bounds on the number of disks needed to identify any set of n points of the plane. In particular, we prove that if there are no three colinear points nor four cocyclic points, then roughly n/3 disks are enough, improving the known bound of (n+1)/2 when we only require that no three points are colinear. We also consider complexity issues when the radius of the disks is fixed, proving that this problem is NP-complete. In contrast, we give a linear-time algorithm computing the exact number of disks if the points are colinear.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Identification of points using disks
Valentin Gledel111Univ Lyon, Université Claude Bernard Lyon 1, LIRIS - CNRS UMR 5205, F69622 (France). E-mail: [email protected]
Aline Parreau222Univ Lyon, Université Claude Bernard Lyon 1, CNRS, LIRIS - CNRS UMR 5205, F69622 (France). E-mail: [email protected]
Abstract
We consider the problem of identifying points in the plane using disks, i.e., minimizing the number of disks so that each point is contained in a disk and no two points are in exactly the same set of disks. This problem can be seen as an instance of the test covering problem with geometric constraints on the tests. We give tight lower and upper bounds on the number of disks needed to identify any set of points of the plane. In particular, we prove that if there are no three colinear points nor four cocyclic points, then disks are enough, improving the known bound of when we only require that no three points are colinear. We also consider complexity issues when the radius of the disks is fixed, proving that this problem is NP-complete. In contrast, we give a linear-time algorithm computing the exact number of disks if the points are colinear.
1 Introduction
Let be a set of points of the plane . What is the minimum number of disks so that each point is contained in a disk and no two points are in exactly the same set of disks? In other words, we want to find a minimum set of disks such that every point is in a disk and the disks that contain a given point uniquely determine it. Such a set of disks (not necessarily minimum) is said to identify . See Figure 1 for an example of an identifying set of disks.
The motivation of this problem comes from the localization of indviduals and more generally from the following setting of identification problems: Given a set of individuals with binary attributes that each individual can have or not, the goal is to choose a minimum number of attributes in such a way that each individual has a unique set of attributes. This problem is known in the literature as the test covering problem [16] or identifying codes problem in hypergraph [15] since one can represent the data by a hypergraph where individuals are vertices and attributes are hyperedges. It has many application in particular in medical diagnostics and pattern recognition [13, 16, 20].
In a context of localization, the attributes are defined by the metric of the space where individuals live. As an example, in the context of identifying codes [12], individuals are vertices of a graph. Then the attributes are defined by the closed neighbourhoods, meaning “to be closed to”. Choosing some attributes is equivalent to setting detectors on some vertices that are able to detect errors in their neighbourhood. Then the set of detectors is able to detect any intrusion in the graph. Indeed, if there is an intrusion on a vertex, then the set of detectors that have detected something uniquely determines where the intrusion is. Locating-dominating sets [22, 23] and open locating dominating sets [21] are defined in a similar way. These concepts are studied by various authors since the 1970s and 1980s, and have been applied to various regions such as fault-detection in networks [12, 25] or graph isomorphism testing [2].
In this paper, we consider that individuals (which are the points in our problem) are living in . A detector can be placed anywhere, with any radius of detection and thus is represented by a disk. It can be formulated as a test covering instance: a set of individuals share an attribute if they can be isolated from the other individuals by a disk. It is also related to identifying codes in graphs: if the detectors must be located on points and have a fixed radius, a natural graph structure emerges. Then our problem is equivalent to the problem of identifying codes in unit disk graphs (in the general case) or in unit interval graphs (if points are colinear).
Another motivation comes from the notion of geometric separator in computational geometry [7]. Let be finite disjoint sets of . A finite set of curves in the plane is a separator for the sets ,…, if every connected component in contains points from only one set . Finding separators is a classical problem of computationnal geometry, in particular when considering image analysis. The most studied case is and separation with lines or circles [1]. Our problem – if we forget the condition that each point must be in a disk – can be considered as a separating problem where each set contains only one point and is a union of circles. This problem has been mentionned by Gerbener and Tóth [10] who have considered more generally separation with convex sets. They in particular proved that circles are enough to separate points even if they are in a general configuration (no three colinear points). Separators of single points have also been studied for lines. Bolland and Urrutia [19] gave an algorithm of time complexity to find a family of lines that separates any set of points in a general configuration. Cǎlinescu, Dumitrescu and Wan [5] proved that in the particular case where the lines are parallel to the axis, the problem is NP-complete and gave a constant approximation polynomial algorithm for this case. A natural extension in higher dimension, called multi-modal sensor allocation problem, has been defined in [14], making links with identification problems. Note that the separating problem with lines is a subproblem of ours. Indeed, if the points are given, one can consider a line as a very large circle.
In Section 2, we give formal definitions and background that will need along the paper. In Section 3, we study particular configurations of points: colinear or forming a grid. For colinear points, we give the exact number of disks needed if any radius can be used. If the points are on a grid, we give exact values for height 2 and bounds for larger heights. In Section 4, we give tight lower and upper bounds: we prove that at least and at most disks are necessary ( is the number of points). If moreover there are no three colinear points nor four cocyclic points, then we prove, using Delaunay triangulation, that we need at most disks. Finally, in Section 5, we discuss the complexity of the problem when the radius is fixed , we prove that it is NP-complete in the general case but that there is a linear algorithm to solve it when the points are colinear.
2 Definition and background
2.1 Formal definition
Let be a set of points of . A disk of radius and center is the set of points of at distance at most of . A point is covered by a disk if it belongs to it. Two points and of are separated by a disk if exactly one of them is covered by . A set of disks is identifying if it is covering all the points of and separating all the pairs of points of . We denote by the minimum number of disks needed to identify . Let , we denote by the minimum number of disks of radius needed to identify . When is large enough compare to the distances between the points of , any disk of radius is separating the same pairs of points as some half-plane. Hence, identification with half-planes is a particular case of identification with disks of fixed size. We will denote by the corresponding number.
Remark.
In our definition, we ask that every point of must be covered by at least one disk. This choice could be discussed. Indeed, it is not the case for similar notions like separating families or test covers. We choose this definition to be consistent with our first motivation: in a context of localization, our detection system must be able to detect if there is an intrusion or not, which is possible only if all the points are covered. This is the reason why in identifying codes there is the condition of domination (see Section 2.2 for formal definition of identifying codes). However, if a set of disks is separating all the pairs of points of , at most one point is not covered (otherwise all the points that are not covered will not be separated). Therefore, we need at most one more disk to obtain an identifying set. Hence the difference between the two values is at most one and our results can be easily adapted for only-separating sets.
For any radius and any points and of , there is always a disk of radius that separates them. Hence, and are well-defined and always smaller than . Moreover, if the radius is not fixed or small enough, one can take for each point a disk containing only this point and then forms an identifying set of disks. Thus, we have . About the lower bound, consider a set of disks identifying . Since each point is contained in a unique non-empty subset of , there are at most points in , leading to the following lower bound on :
Lemma 1**.**
Let be a set of points of , then .
These trivial lower and upper bounds are not tight and will be improved in Section 4.
Finally, since a set of disks identifying is identifying any subset of , we have the following lemma:
Lemma 2**.**
Let and be two sets of points of with , then .
2.2 Related work
Among the related notions given in the introduction, we give formal definitions for three of them that we will need in the rest of the paper.
Separating families of disks.
If is only separating any pair of vertices of , is a separating family of disks, studied by Gerbner and Tóth [10] in the more general context of convex sets. They in particular consider the parameter and ) which stand for the maximum number of disks that are needed to separate any -point set and any -point set in general position (no three of its points are on a line). They prove that . Since at most one more disk is necessary to obtain an identifying set of disks from a separating set of disks, it means that is at most . We will improve this bound in Section 4 to if moreover no four points are cocyclic.
Identifying codes in unit interval and unit disk graphs.
Let be a graph. A vertex dominates a vertex if is in the closed neighbourhood of (i.e: and its neighbours). It separates two vertices and if it is dominating exactly one of them. An identifying code of is a subset of vertices such that each vertex is dominated by some vertex of and each pair of vertices of is separated by some vertex of . We denote by the minimum number of vertices in an identifying code of . Note that is not always well-defined since might have two vertices with exactly the same neighbourhood and thus no vertex can separate them. Such vertices are called twin vertices. If a graph does not have any twins, then it has an identifying code (take for example all the vertices in ).
Identifying codes are closely related to identifying sets of disks when considering graphs of geometric intersections. Given a set of geometric objects, one can define its intersection graph as follows. Vertices are the objects and there is an edge between two objects if they intersect. A class of graphs of particular interest for us is the class of unit disk graphs that are the intersection graphs of disks of radius 1. Let be a unit disk graph and denote by the set of centers of the disks forming . Then an identifying code of is equivalent to an identifying set of using disks that have radius 2 and are centered on points of . Indeed, a disk of radius 2 centered on a point of contains all the points that are centers of disks of the closed neighbourhood of the disk corresponding to in . Identifying codes in unit disk graphs have been studied by Müller and Sereni [17] who prove, in particular, that the minimization problem in NP-complete. If the points of are colinear, then is a unit interval graph. The complexity of identifying codes in unit interval graphs is surprisingly still open [9] (but has been proved to be NP-complete for interval graphs).
Junnila and Laihonen [11] studied identifying codes in the grid using Euclidean balls. The underlying graph has the set as vertices and the closed neighbourhood are given by the Euclidean balls of a fixed radius . This graph can also be seen as an (infinite) unit disk graph. They give lower and upper bounds on the density of minimum identifying codes in this graph in function of .
Identifying codes in hypergraphs.
The notion of identifying codes can be extended to hypergraphs. Let be a hypergraph. An identifying code of is a set of hyperedges such that:
- •
each vertex of is in at least one element of ;
- •
for each pair of vertices of , there is an element of containing exactly one element of the pair.
An identifying code in a graph is equivalent to an identifying code in the hypergraph of the closed neighbourhoods of . As said in the introduction, this notion is known under different names and has actually been introduced before identifying codes in graphs, see [15, 16]. Our problem can be reduced to identifying codes in hypergraph. Indeed, let be a set of points of . Let be the hypergraph with vertex set and a set of points is a hyperdege if there exists a disk such that . Then finding an identifying set of disks identifying is equivalent to finding an identifying code in . Note that an hyperedge of of size corresponds to a nonempty cell in the iterated Voronoï diagram of size of and can be computed in time [18]. The whole hypergraph can be obtained by computing all iterated Voronoï diagrams of . This can be done in time and the number of hyperedges of is of order [8].
3 Particular configurations
3.1 Colinear points
When points are located on a single line, the problem is completly solved with the following theorem.
Theorem 3**.**
Let be a set of colinear points, then .
Proof.
Let be a set of colinear points located on a line . We denote by the points, respecting their order on .
Let be a set of disks identifying . For any , and are separated by . It means that there is a disk , such that its perimeter intersects between and . Moreover, and are covered by , thus there is a disk whose perimeter intersects before and after . In total, there are at least intersections between and some disks’ perimeters. Since a circle intersects a line into at most two points, we necessarily have .
To prove the equality, note that for any subset of consecutives points of , there exists a disk such that . Then the set of disks
[TABLE]
has size and is identifying . See Figure 2 for an illustration with nine points.
∎
In the solution given in the proof of Theorem 3, some disks might have a big radius. Actually, if the radius of the disks is bounded by a constant , disks are sometimes needed and can take any value between and . In Section 5, we give an algorithm that computes in linear time.
3.2 Points located on a grid
We now consider points located on a regular grid. Given two integers and , we denote by the set of points of such that and .
3.2.1 Grids of height 2
When the grid contains only two lines, one can identify the points using the same number of disks than on a single line, except in few cases:
Theorem 4**.**
Let be an integer. We have:
[TABLE]
Proof.
We can first see that for all , . Indeed, to identify one can use the method proposed in Theorem 3 and add an half-plane (which can be seen as a very large disk) to separate the lines as in Figure 3.
For grids with , this solution is optimal by Lemma 1. In Section 4 Proposition 7, we show that at least five disks are needed to identify a set of 14 points, so there is no better solution for . For all the other cases, we show that we only need disks. We only have to study the case where is odd or equal to 6. Indeed, by Lemma 2, solution for is also a solution for by removing the points of the last column.
We first give a characterization for a set to be the intersection of and a disk. Let be such a set, is the union of two sets of consecutive points of the first line , … , and of the second line , … , , with , and . We must have either or and the difference between each extremities must differ of at most 1: . This condition is sufficient since for every verifying this relation, there exist a disk that contains exactly these consecutive points.
An explicit solution for the grids , and are the following disks :
- •
can be identified by the set of disks : , , and .
- •
can be identified by the set of disks : , , , and .
We now give a solution for grids , with . This solution use three different steps. Figure 4 gives an illustration of these three steps.
The first step is to use the disks , , and . After adding these disks, the points of each line are separated from the other line. Indeed, the points of the first line in the intervals and are in the disk and are not in the disk which separate them from all the points of the second line. Similarly the points of the first line and in the intervals and are in the disk and are not in the disk , which separate them from all the points of the second line.
In the second step, we add the disks and . These two disks separate the points on the columns , and , which weren’t until now.
After this, all the points are covered by at least one disk and the points that are no separated from each other are the same line and on the intervals , , and (the last two intervals occurs if ).
In the third step, we can now finish identifying the points by adding the following concentric disks :
, , … , , , , … , .
We use four disks in the first part, then two disks and finally . So in total we use disks, which is equal to disks.
For the grids , we can remove the points of the columns 1 and and the disk .
So we can indeed identify the grids , with with disks.
∎
3.2.2 General case
We now consider the general case of grids , . We first solve the case of identification with half-planes - which can be considered as disks with infinite radius.
Theorem 5**.**
Let be two integers. Then, .
Proof.
We denote by the points on the convex hull of , respecting their order.
Let be a set of half-planes identifying . For any , and are separated by (with associated with ). It means that the there is a half-plane whose boundary line intersects the convex hull of between and . In total, there are at least intersections between the convex hull of and boundary lines of some half-planes of . Since a line intersects a convex polygon into at most two points, we necessarily have .
Consider any vertical lines between adjacent points and horizontal lines between adjacent points (see Figure 5 for an example). Then every pair of points is separated by a line. To obtain a solution, one just need to choose half-planes with these lines as boundary and in such a way that every point is covered. ∎
This theorem gives a bound for the general case: . This bound is not tight, especially when is large enough compared to . Next theorem gives a better (but still not tight) bound in this case:
Theorem 6**.**
Let and be two integers such that and . Then .
Proof.
The idea is to use a method similar to the one described in Figure 3. We use half-planes to separates the lines and disks to separates the columns. When is large enough the disks act on each line in the same way they act when there is only one line.
Since we use half-planes to separate the lines and include all the points into a disk: the bottom half-plane includes all the points above itself and the top half-plane includes all the points below itself.
We now use disks of radius and centered on with an integer between 0 and . Since , those disks contains points on each line and they separates all the columns. An example of such disks can be seen in Figure 6. Since all the points are inside a half-plane, there is no need for all the columns to be inside a disk. That is why we only need disks instead of as in the case of one line.
There is disks to separate the columns and half-planes to separate the lines, this gives us in total.
∎
4 Extremal cases
In this section, we give tight lower and upper bounds on using the number of points of when the points are in general configuration.
4.1 Lower bound
The logarithmic lower bound given in Lemma 1 is the natural lower bound for identifying codes in hypergraphs. It is tight if any hyperedge is allowed. But if there is some structure on the hyperedges, this is not always true. In particular, if the hyperedge set has bounded dual VC-dimension , then the lower bound is at least of order [3]. This is the case for our problem since the hypergraph induced by disks have bounded dual VC-dimension equals to 3, leading to a lower bound of order . However, this bound is still not tight. Indeed, we provide in this section a lower bound of order . This bound comes from the fact that an arrangment of disks can create at most inner faces. This classical result can be proved by induction with the argument that each time one add a circle to a set of circles it cross each circle at most twice (see [24] for more details and references). Since if some disks are identifying a set of points, there is at most one point in each faces of the intersections of the disks, we have the following bound:
Proposition 7**.**
Let be a set of points of . Then, This bound is tight.
To obtain a set of points reaching the bound, one can use an arrangment of disks making inner faces and set one point in each face. Such an arrangment can be obtained by taking disks of radius , centered on vertices of a -regular polygon that is inscribed in a circle of radius 1. See Figure 7 for a construction with k=5.
Since an identifying code of a unit disk graph can be seen as an identifying set of special disks, the lower bound of Property 7 is still true for identifying codes in unit disk graphs, improving the bound given in [4]. Moreover, this bound is also tight for this case. Indeed, the construction of Property 7 can be adapted for identifying codes of unit disk graphs since all the disks have the same radius and their center can be points.
4.2 Upper bound
We now consider the worse configurations of points. Otherwise said, what is the minimum number of disks that is enough to identify any set of points? This question has already been solved by Gerbner and Tóth when one just wants to separate points [10]. They prove that disks are always enough and that this is the best value one can obtain since there are point sets needing this number of disks. Since one more disk is enough to obtain an identifying set, it gives us the bound . Actually, we can slightly improve it by noticing that in the proof of Gerbner and Tóth [10], all the points are covered if there is an odd number of points. For the sake of completeness we give the proof, it follows the one of [10].
Proposition 8**.**
Let be a set of points of . Then, . This bound is tight.
Proof.
Since is finite there exists a direction we can choose as abscissa, such that no pair of points have the same abscissa. Then there is a line of constant abscissa which separates the points in two parts of the same size up to one. Let be the half-plane defined by and containing the biggest part of points as one of the disks. We choose a direction perpendicular to the abscissa as the ordinate.
At first, all the points are part of a set and the set of identifying disks contains only . Then we repeat the following operation times. Consider the convex hull of , exactly two of its edges cross . Let be the edge which intersects on the largest ordinate. Since it is an edge of the convex hull, there is no point of with a larger ordinate than and . Therefore there is a disk that contains only and among the points of . Add this disk to the set of identifying disks and remove and from . Iterate the process.
This algorithm gives a set of disks that identifies of size . Indeed, at each step, and are separated from all the other points of by and since all the other points have been considered previously they are also separated from them. Moreover, since is an edge that crosses , and are separated from each other by . At the end, if there is an odd number of points, there is a point that is only in , since this is the only point that is only in , it is identified.
Figure 8 illustrates some steps of the algorithm. This bound is tight when all the points are colinear (see Theorem 3). ∎
All the values between the lower bound of Property 7 and the upper bound of Property 8 are reached:
Theorem 9**.**
Let and be such that .
There exists an -point set of such that .
Proof.
Consider the optimal arrangement of disks based on a regular polygon given on Figure 7. There is a line cutting this construction into regions. Indeed, let be a line going through an intersection of disks and the center of the polygon, for symmetry reason, this line goes through regions and intersections. So by shifting the line infinitesimally and parrallely, it is still going through the previous regions but for each intersection we have a new region. Therefore, there is indeed regions crossed by this new line .We set points on in the different regions of the arrangement of disks. Since we can set the remaining points in the other regions of the arrangement. At the end, the set of disks of the arrangment identifies the points that we have put in its different regions and, since there are colinear points, no smaller set of disks can identify these points.
∎
4.3 Improved upper bound for general configurations
The upper bound of Proposition 8 is tight for colinear points. A natural question is whether the bound is still tight if there are no three colinear points among . Actually, the bound is also tight if the points are cocyclic. But, if there are no three colinear points nor four cocyclic points in , the upper bound is not tight anymore. In this section, we said that a set of points of is in general configuration if there are no three points of on a line nor four points of on a circle.
Theorem 10**.**
Let be a set of points in general configuration. Then
The idea of the proof of this theorem is to give an algorithm that constructs an identifying set of disks of size . The algorithm is based on the same principle that we used in the not restricted case:
Divide in three equal parts using lines; 2. 2.
Choose a disk that contains exactly one point in each part, remove these points and repeat the operation.
The crucial part is to find the disk of the second step. For that, we use Delaunay triangulations - that is a triangulation of the points in such a way that the circumcircle of each triangle only contains its vertices. Since there are no three colinear nor four cocyclic points, such a triangulation always exists (and is unique). To find the disk of Step 2, we then need to find a Delaunay triangle that has a vertex in each part, as illustrated in Figure 9. To insure the existence of such a triangle, we need to be more precise at Step 1.
Before going into details, we need two preliminary results.
Theorem 11** (Ceder [6]).**
For points of with no three colinear points, there is a way to divide the plan in six regions containing each between and points using three concurrent lines.
Lemma 12**.**
Let be a set of points of , a line and a half-line with origin on . If each of the three regions , and made by and contains one point of and if is in the convex hull of , then every triangulation of contains a triangle that has a vertex in each region.
Proof.
Let be a triangulation of . Since the intersection of and is inside the convex hull, there is at least a segment of between any pair of regions.
Consider the segments between the regions separated by , namely and . Let be the segment which cut the closest from . Since is in the convex hull of , there is at least one point of such that is a triangle of in the direction of from this segment. If is in or , then the segment or would intersects closer to than , which contradicts the hypothesis that is the closest segment to . Therefore is in and form a triangle with one vertex in each region.
∎
Proof of Theorem 10.
Using Theorem 11, there exist three concurrent lines , and that divides the plane into six regions of the same size up to one. Let be their common intersection. Let , and be three half-planes defining by , and such that every point is in at least one half-plane. Let , , , , and be the six regions of the plane created by these lines, as illustrated in Figure 10.
First consider the regions , and . Each of these regions contains between and points. For construction reasons, if there is a point in each region then is inside the triangle formed by these points, so Lemma 12 always applies.
Consider the following process. Add , and to the future set of identifying disks. Set in all the points of , and . Then repeat the following operation times. Consider the Delaunay triangulation of , at least one triangle has its vertices in the three different regions. Since it is a triangle of a Delaunay triangulation, its circumscribed circle contains no other remaining points. Add , the disk of perimeter , to the set of identifying disks, remove , and from and iterate the process.
We do the same iterated operation for the regions , and .
At the end of each step of the process all the considered points are separated from all the other points. Indeed, at each step , and are separated from the points of the other regions by the half-planes and, since each considered triangle comes from a Delaunay triangulation, their circumscribed circle contains no other points of .
If a point has not been considered at the end of the processes, it is alone in its regions and therefore is isolated from all the other points. Moreover, by the selection of the half-planes, it is inside at least one half-plane so it is covered.
Therefore, this algorithm constructs an identifying set of disks of size . ∎
The previous bound is tight, up to a constant 2, when points are located on an half-parabola, the curve constituted of one side of a parabola symmetry axis:
Proposition 13**.**
Let be a set of points placed on an half-parabola. Then,
Proof.
Let be a set of points located on a half-parabola . We denote by the points, respecting their order on , with the closest to the extrema of .
Let be a set of disks identifying . For any , and are separated by . It means that there is a disk whose perimeter intersects between and . Moreover, is covered by , thus there is a disk whose perimeter intersects after . In total, there are at least intersections between and some perimeters of disks of .
We now prove that a circle can intersect into at most three points. Let . Without loss of generality, satisfies the following set of equations, with , , that are constant.
[TABLE]
In particular, is a solution of:
[TABLE]
There is no term in in the previous equation. Thus, if , , and are solutions of (1, we have . Since , there are at most three possible values for .
Since there are at least intersections between and an identifying set of disks and since a circle intersects a half-parabola at most three times, we necessarly have .
∎
5 Complexity when the radius is fixed
In this section, we consider the complexity of the following decision problems (with ):
Identification-Disk(r)
Instance: A finite set , an integer .
Question: Is it true that ?
Colinear Identification-Disk(r)
Instance: A finite set of colinear points, an integer .
Question: Is it true that ?
Theorem 14**.**
Identification-Disk*(r) is -complete.*
Proof.
We prove the result for which is not restrictive. We reduce this problem from the problem of partitioning a grid graph into path on three vertices. A grid graph is a graph with vertex set included in and two vertices are adjacent if they are at Euclidean distance 1.
-Partition-Grid
Instance: A grid graph .
Question: Is there a partition of the vertices of in such a way that each part induces a path on three vertices?
Bevern et al. [26] proved that this problem is -complete.
Let be an instance of -Partition-Grid and . The instance of the identification problem is and . We have to prove that has a -partition if and only if is identified by disks of radius .
Assume first that there is a partition of into path on three vertices. For each part of the partition, add to the identifying set the disk of radius that contains and and the disk that contains and . Since and are edges of , these disks exist and contains exactly two points. Furthermore, is the only point that is contained only in , is the only point that is contained only in and is the only point that is contained exactly in and . Hence, we obtain an identifying set of disks of size .
Assume now that there is an identifying set of disks that identify with disks. Since the points are at distance at least 1, every disk contains at most two points. Without loss of generality, we can suppose that if a disk contains only one point, then this point is not included in any other disk. Indeed, assume there are two points and two disks and such that contains only and contains both and , then we can replace by that contains only and the situation is similar, is still identified and the number of disks is the same.
Let be the number of disks containing only one point. Let be the points not covered by these disks. Let be the graph with vertices and edges if are contained together in a disk of . Note that is a subgraph of . The graph has vertices and its connected components have at least three vertices. Indeed, if a component as only two vertices then these vertices are not identified. So there are connected components. We name these connected components . The number of edges of is .
Since a disk is either containing one point (and there are such disks) or corresponds to an edge of , there are at least disks in . Therefore we necessarily have and there are exactly connected components in , each of them being of size 3. This is a -partition of . ∎
However if all of the points are colinear then this problem can be solved in linear time:
Theorem 15**.**
Colinear Identification-Disk*(r) can be solved in linear time.*
Note that if the disks are required to be centered on the points, this problem is equivalent to the problem of identifying codes in unit interval graphs, whose complexity is surprisingly still open.
To prove Theorem 15, we introduce few definitions and preliminary results. Let be a set of colinear points on a line and is a set of disks of radius identifying . Let ,.. be the points of . We confuse with its abscissa on and we assume that .
Note that, since the points are colinear and the centers of the points can be chose anywhere on the plane, a set of points of can be the intersection of and a disk of radius if and only if there are consecutive points of at distance at most . In the following, we will refer often directly to the set of points contained in a disk instead of itself.
The set is optimal if , it is perfect if is odd and if (in particular is optimal).
The disks of partitions the points of on connected components. More formally, we define an equivalence relation meaning that and are connected by a path of disks. For and two points of , we have if and only if there is a disk in that contains both and , or there is a point in such that and .
The equivalence classes of are made of consecutive points. Let be the disks containing points the points of .
A set of disks is piece-wise perfect if it is optimal and if each perfectly identifies .
Lemma 16**.**
For any set of colinear points , there is a set of disks of radius that identifies and is piece-wise perfect.
Proof.
Let be a set of disks that identifies optimally and such that is minimal. We will prove that this set is piece-wise perfect.
Assume the contrary. It means that there is a set which is not perfectly identified by . Following the proof of Theorem 3, this means that there are two disks and whose perimeters intersect between the same pair of adjacent points of or both before the first point of or both after the last point of . Let be the points covered by and the points covered by .
**Case 1 **: (the case is similar). Suppose, without loss of generality, that . Let be a disk that contains the points from to , such a disk exist because its intersection with is included in . Then identifies . Indeed, the only point of for whom the situation is different for and is . For it is the only point of that is inside and not inside . So we have a new set of disks that identifies and such that the sum of the number of points contained in each disk is smaller. This is a contradiction to the minimal property of .
Case 2 : (the case is similar). Since is an equivalence class for the relation , we must have and there must be a disk such that contains both and . Let the first point of and its last point.
Subcase 2.1 : .
Let be a disk that contains the point from to , such a disk can exist because it is smaller than , and let . identifies and contradict the minimal property of .
Subcase 2.2 : (the case is similar).
Let be a disk that contains the point from to , such a disk can exist because it is smaller than and be a disk that contains the points from to , such a disk can exist because it is smaller than . The set of disks identifies . Indeed, the only points of for whom the situation is different for and are those between and . They are still separated from each other by the disks that separates them in and they are separated from the other points because they are the only points that are in and not in . The sum of the number of points contained in the disk is the same for and .
Finally, the sum of the number of points contained in the disks remains the same, but now the disk that contains both and does not contain the disks that intersect the region between and . So we are now in Subcase 2.1 and we can apply the method used in this case, concluding the proof.
∎
A set of disks identifying a set of colinear points is in normal form if is odd and if :
- •
and is composed of a unique disk containing the point of .
or
- •
, , and with containing the points and , containing the points and and, for , containing the points , and .
In particular, is perfect.
Lemma 17**.**
For any set of colinear points , if there is a set of disks that perfectly identifies , then there exists a set that identifies and is in normal form.
Proof.
If there is only one point in , then the only way to identify perfectly is to have a set that contains exactly one disk which contains the point of , it always exists and it is already in normal form.
Suppose that is of size with . We show that if there is no set of disks identifying in normal form then there is no set perfectly identifying .
Assume that there is no set of disks identifying in normal form but that a set perfectly identifies . Necessarily, has only one equivalnce class and all the adjacent points of are distant at most .
Since there is no possible set in normal form, there exists such that the distance between the points and is greater than . Let be the set and be the set . Let (respectively ) be the subset of disks of that contains at least one point of (resp. ). The intersection between and is empty since the distance between and is at least . By Theorem 3, since identifies and identifies , and . So . Hence does not identify perfectly, a contradiction. ∎
Proof of Theorem 15.
By Lemma 16, there exist a set identifying that is piece-wise perfect. By Lemma 17, every perfect part of that set can be identified by a set of disks in normal form. So there is a piece-wise perfect set of disks such that every is in normal form.
We now give an algorithm that finds an optimal solution to identify with connected sets of disks that are in normal form :
This algorithm takes the biggest connected sets of disks in normal form starting with the first point of . We prove that this is optimal. Assumme it is not the case. Let be a set of points such that the set given by the algorithm is not an optimal solution. We choose with a minimum number of points. Let be an optimal set in normal form. Its first connected component is smaller than the first connected component of , . Indeed, it cannot be bigger since the algorithm take the biggest connected component and it cannot be the same since, by minimality of , the algorithm is optimal on the rest of the points. So identifies , the points of that are not in the disks of with less disks than uses to identify , the points of that are not in . Since , also identifies , and thus with less disks than . This contradicts the minimality of .
This algorithm is linear since we consider each point at most once.
So there is a linear algorithm to find the maximum number of disks needed to identify a set of points if each connected part must be in normal form. By Lemma 16 and Lemma 17, this algorithm also gives a solution to Colinear Identification-Disk(r). ∎
6 Conclusion
We conclude with some open problems. About complexity issues, we do not know if computing a minimum identifying set of disks when the radius is not fixed is -complete, but the contrary would be surprising. The complexity of identification with lines seems to be also open. An intersecting question is what is the number of disks needed if the points are randomly chosen in a square. It would also be interesting to consider identifications with other sets or in higher dimensions using balls instead of disks.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Esther M Arkin, Ferran Hurtado, Joseph SB Mitchell, Carlos Seara, and Steven Skiena. Some separability problems in the plane. In Euro CG , pages 51–54, 2000.
- 2[2] László Babai. On the complexity of canonical labeling of strongly regular graphs. SIAM Journal on Computing , 9(1):212–216, 1980.
- 3[3] Laurent Beaudou, Florent Foucaud, Peter Dankelmann, Michael A Henning, Arnaud Mary, and Aline Parreau. Bounding the order of a graph using its diameter and metric dimension: a study through tree decompositions and vc dimension. ar Xiv preprint ar Xiv:1610.01475 , 2016.
- 4[4] Nicolas Bousquet, Aurélie Lagoutte, Zhentao Li, Aline Parreau, and Stéphan Thomassé. Identifying codes in hereditary classes of graphs and vc-dimension. SIAM Journal on Discrete Mathematics , 29(4):2047–2064, 2015.
- 5[5] Gruia Călinescu, Adrian Dumitrescu, Howard Karloff, and Peng-Jun Wan. Separating points by axis-parallel lines. International Journal of Computational Geometry & Applications , 15(06):575–590, 2005.
- 6[6] Jack G Ceder. Generalized sixpartite problems. Bol. Soc. Mat. Mexicana (2) , 9:28–32, 1964.
- 7[7] Olivier Devillers, Ferran Hurtado, Merce Mora, and Carlos Seara. Separating several point sets in the plane. In In Proc. 13th Canadian Conference on Computational Geometry . Citeseer, 2001.
- 8[8] Herbert Edelsbrunner, Joseph O’Rourke, and Raimund Seidel. Constructing arrangements of lines and hyperplanes with applications. SIAM Journal on Computing , 15(2):341–363, 1986.
