Temporal Clustering
Tamal K. Dey, Alfred Rossi, Anastasios Sidiropoulos

TL;DR
This paper introduces the concept of temporal clustering for sequences of unlabeled point sets, proposing optimization problems and algorithms that balance cluster count, spatial cost, and displacement, with theoretical bounds and limitations.
Contribution
It formulates a new framework for clustering evolving data, generalizing classical clustering objectives to temporal settings and providing algorithms with theoretical guarantees.
Findings
Developed algorithms balancing cluster number, cost, and displacement.
Established inapproximability results for temporal clustering.
Generalized classical clustering objectives to temporal data.
Abstract
We study the problem of clustering sequences of unlabeled point sets taken from a common metric space. Such scenarios arise naturally in applications where a system or process is observed in distinct time intervals, such as biological surveys and contagious disease surveillance. In this more general setting existing algorithms for classical (i.e.~static) clustering problems are not applicable anymore. We propose a set of optimization problems which we collectively refer to as 'temporal clustering'. The quality of a solution to a temporal clustering instance can be quantified using three parameters: the number of clusters , the spatial clustering cost , and the maximum cluster displacement between consecutive time steps. We consider spatial clustering costs which generalize the well-studied -center, discrete -median, and discrete -means objectives of classical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Temporal Clustering
Tamal K. Dey
Dept. of Computer Science and Engineering, and Dept. of Mathematics, The Ohio State University. Columbus, OH, 43201. [email protected]
Alfred Rossi
Dept. of Computer Science and Engineering, The Ohio State University. Columbus, OH, 43201. [email protected]
Anastasios Sidiropoulos
Dept. of Computer Science and Engineering, and Dept. of Mathematics, The Ohio State University. Columbus, OH, 43201. [email protected]
Abstract
We study the problem of clustering sequences of unlabeled point sets taken from a common metric space. Such scenarios arise naturally in applications where a system or process is observed in distinct time intervals, such as biological surveys and contagious disease surveillance. In this more general setting existing algorithms for classical (i.e. static) clustering problems are not applicable anymore.
We propose a set of optimization problems which we collectively refer to as temporal clustering. The quality of a solution to a temporal clustering instance can be quantified using three parameters: the number of clusters , the spatial clustering cost , and the maximum cluster displacement between consecutive time steps. We consider spatial clustering costs which generalize the well-studied -center, discrete -median, and discrete -means objectives of classical clustering problems. We develop new algorithms that achieve trade-offs between the three objectives , , and . Our upper bounds are complemented by inapproximability results.
Contents
1 Introduction
Clustering points in a metric space is a fundamental problem that can be used to express a plethora of tasks in machine learning, statistics, and engineering, and has been studied extensively both in theory and in practice [3, 7, 12, 18, 19, 20, 22, 23, 25, 26, 28, 31]. Typically, the input consists of a set of points in some metric space and the goal is to compute a partition of minimizing a certain objective, such as the number of clusters given a constraint on their diameters.
We study the problem of clustering sequences of unlabeled point sets taken from a common metric space. Our goal is to cluster the points in each ‘snapshot’ so that the cluster assignments remain coherent across successive snapshots (across time). We formulate the problem in terms of tracking the centers of the clusters that may merge and split over time while satisfying certain constraints. Such instances are common in the study of time-evolving processes and phenomena under discrete observation. As an example consider a hypothetical study which aims to track the spread of a certain genetic mutation in plants. Here, data collection efforts center on annual field surveys in which a technician collects and catalogs samples. The location and number of mutation positive specimens change from year to year. Clustering such spaces is clearly a generalization of classical (static) clustering, which we refer to as temporal clustering. In this dynamic variant of the problem, apart from the number of clusters and their radii, we also wish to minimize the extent by which each cluster moves between consecutive snapshots.
Related work
Clustering of moving point sets has been studied in the context of kinetic clustering [1, 5, 16, 17, 15, 13, 30]. In that setting points have identities (labels) which are fixed throughout their motion, the trajectories of the points are known beforehand, and the goal is to design a data structure which can efficiently compute a near-optimal clustering for any given time step. In our setting, since the points are not labeled there is, a priori, no explicit motion. Instead we are given a sequence of unlabeled points in a metric space and are required to assign the points of each to a limited number of temporally coherent clusters. Motion emerges as a consequence of cluster assignment. Consequently, kinetic clustering algorithms cannot be used in our setting. Another related problem concerns clustering time series under the Fréchet distance [9], with the clusters being constrained to move along polygonal trajectories of bounded complexity. This constraint is used to avoid overfitting, and is conceptually similar to our requirement that the clusters remain close between snapshots.
1.1 Problem formulations
Let us now formally define the algorithmic problems that we study in this paper. Perhaps surprisingly, very little is known for temporal clustering problems. There are of course different optimization problems that one could define; here we propose what we believe are the most natural ones.
We first define how the input to a temporal clustering problem is described. Let be a metric space. Let be a sequence of finite, non-empty metric subspaces (points) of . We refer to individual elements of this sequence (the ‘snapshots’) as levels, and collectively to as a temporal-sampling of of length . The size of is the total number of points over all levels, that is . Let be a sequence of points such that is a single point. We say that is a trajectory of , and we let denote the set of all possible trajectories of . For some , we denote by the set of points of the trajectories in which lie in . In other words, . The set of trajectories induces a clustering on each level by assigning each to the trajectory that minimizes . We refer to the points of as the centers of level . Intuitively, this formulation allows points in different levels of which are assigned to the same trajectory to be part of the same cluster; see Figure 1. Further, observe that trajectories may overlap allowing clusters to merge and split implicitly; see Figure 3(a). We refer to as a temporal-clustering of .
We now formalize the clustering objectives. Our approach is to treat temporal clustering as a multi-objective optimization problem where we try to find a collection of trajectories such that their induced clustering ensures three conditions: (i) points in the same cluster remain near between successive levels (locality), (ii) the restriction of the clustering to any single level fits the shape of the data (spatial constraint), and (iii) we do not return excessively many clusters (complexity). To measure how far some trajectory jumps, we define its displacement, denoted by , to be We also define the displacement of to be Finally, we consider three different objectives for the spatial cost, which correspond to generalization of the -center, -median, and -means respectively. The first one, corresponding to -center, is the maximum over all levels of the maximum cluster radius; formally where . The second and third spatial cost objectives, which corresponding to discrete -median, and discrete -means (respectively), are defined to be , and .
Definition 1.1**.**
Let , . We say that a set of trajectories is a temporal ,,-clustering of if , , and . (See Figure 1 for an example.) We further define temporal -median-clustering and -means-clustering analogously by replacing by and respectively.
We now formally define the optimization problems that we study. In the case of static clustering, a natural objective is to minimize the maximum cluster radius, subject to the constraint that only clusters are used; this is the classical -Center problem [22]. Another natural objective in the static case is to minimize the number of clusters subject to the constraint that the radius of each cluster is at most , for some given ; this is the -Dominating Set problem [21]. Our definition of temporal clustering includes the temporal analogues of -Center and -Dominating Set as special cases.
Definition 1.2** (Temporal ,,-Clustering problem).**
An instance of the Temporal ,,-Clustering problem is a tuple , where is a metric space, is a temporal-sampling of , , , and . The goal is to decide whether admits a temporal ,,-clustering.
Definition 1.3** (Temporal ,,-Clustering approximation).**
Given an instance of the Temporal ,,-Clustering problem consisting of a tuple , a ,,-approximation is an algorithm which either returns a temporal ,,-clustering of , or correctly decides that no temporal ,,-clustering exists. In general , , and can be functions of the input.
We analogously define the Temporal ,,-Median Clustering problem and approximation, and the Temporal ,,-Means Clustering problem and approximation by replacing in Definitions 1.2 and 1.3 -clustering by -median-clustering and -means-clustering respectively.
1.2 Our contribution
To the best of our knowledge, this is the first study of the above models of temporal clustering. Our main contributions consist of polynomial-time approximation algorithms for several temporal clustering variants, and hardness of approximation results for others.
Temporal clustering. We begin by discussing our results on Temporal ,,-Clustering. We first consider the problem of minimizing and while keeping fixed. This is a generalization of the static -Center problem. The classical greedy algorithm that yields the -approximation to the static case, which is known to be optimal assuming [22], does not appear to be applicable in the temporal setting. The reason is that a static solution cannot account for the requirement that the cluster centers of the same trajectory cannot be too far apart in consecutive levels. We present a polynomial-time -approximation algorithm where using a different method. More specifically, our result is obtained via a reduction to a network flow problem. We show that the problem is \NP-hard to approximate to within polynomial factors even if we increase the radius by a polynomial factor. Formally, we show that it is \NP-hard to obtain a -approximation.
Next we consider the problem of minimizing the number of clusters , while fixing and . This is a generalization of the static -Dominating Set problem. We obtain a polynomial-time -approximation algorithm. For the static case, the polynomial-time -approximation algorithm follows by a reduction to the Set-Cover problem, and is known to be best-possible [8, 29, 11]. However, in the temporal case, this reduction produces an instance of Set-Cover of exponential size. Thus, it does not directly imply a polynomial-time algorithm for Temporal -Dominating Set. We bypass this obstacle by showing how to run the greedy algorithm for Set-Cover on this exponentially large instance in polynomial-time, without explicitly computing the Set-Cover instance. We also argue that -approximation is best possible by observing that -approximation is \NP-hard for any .
We further present a result that can be thought of as a trade-off between the above two settings by allowing both the number of clusters and the radius to increase. More precisely, we obtain a polynomial-time -approximation algorithm where . Interestingly, we can show that obtaining a -approximation is \NP-hard.
The following summarizes the above approximation algorithms.
Theorem 1.4**.**
Temporal ,,-Clustering* admits the following algorithms:*
- 1.4.1.
,,-approximation where , 2. 1.4.2.
,,-approximation, 3. 1.4.3.
,,-approximation where ,
where is the size of the temporal-sampling. Moreover, the running time of all of these algorithms is .
We prove Theorems 1.4.1, 1.4.2, 1.4.3 in Sections 2.1, 2.2, 2.3, respectively.
It is important that the approximation in displacements for Theorem 1.4.1 and Theorem 1.4.3 takes into account the factor if a polynomial time algorithm is aimed for. This is because our inapproximability results as summarized below show that the problem is NP-hard otherwise.
Theorem 1.5**.**
The status of Temporal ,,-Clustering with temporal-samplings of size is as follows:
- 1.5.1.
There exist universal constants , such that ,,-approximation is \NP-hard for any where and . 2. 1.5.2.
,,-approximation is \NP-hard for any fixed , . 3. 1.5.3.
There exists a universal constant such that ,,-approximation is \NP-hard for any fixed , .
Moreover, items 1.5.1 and 1.5.3 remain \NP-hard even for temporal-samplings in -dimensional Euclidean space.
We discuss Theorem 1.5.1 in section 3.1. The discussion of Theorem 1.5.2, and Theorem 1.5.3 are deferred to section 3.2, and section 3.3, respectively.
Temporal median clustering. We next discuss our result on the Temporal ,,-Median Clustering problem. The static -Median problem admits a -approximation via local search [4, 27]. In Section 2.4 we show that the local search approach fails in the temporal case, even on temporal samplings of length two. We present an algorithm that achieves a trade-off between the number of clusters and the spatial cost. The result is obtained via a greedy algorithm, which is similar to the one used for the -Set Cover problem. The result is summarized in the following theorem.
Theorem 1.6**.**
For any fixed , there exists a , -median-approximation algorithm with running time , on an instance of size and a metric space of spread .
The result is obtained by iteratively selecting a trajectory which minimizes a certain potential function. The proof uses submodularity and monotonicity of the potential function. These properties remain true if the potential function is modified by replacing with , and thus an identical theorem holds for Temporal -Means.
We complement the above algorithm by showing the following hardness result.
Theorem 1.7**.**
The status of Temporal ,,-Median Clustering with temporal-samplings of size is as follows:
- 1.7.1.
There exist universal constants , such that ,,-approximation for Temporal -Median is \NP-hard for any where and . 2. 1.7.2.
Let , be the constants from Theorem 3.6. Let . Then -approximation is \NP-hard for any fixed and some constants , .
Moreover, item 1.7.1 remains hard even for temporal-samplings from -dimensional Euclidean space.
The clustering instances used in the proofs of Theorem 1.7.1 in section 3.4 and Theorem 1.7.2 in section 3.5 involve clusterings which use only a constant number of points per cluster, thus the same constructions suffice to prove hardness of Temporal ,,-Means Clustering with only slight modification of the distances.
Additional notation and preliminaries. Let . An -net in some metric space is some maximal , such that for any , with , we have . Let be a temporal-sampling of length in some metric space . Let for all . For any trajectory , and for any , the tube around of radius , denoted by , is defined to be , where for , , we use the notation to denote a closed ball of radius . Let . The directed graph has as vertices for all . For any there is an edge between and whenever (see Figure 2).
2 Algorithms
2.1 Exact number of clusters: ,,-approximation
In this section, we consider the problem of computing a temporal clustering by relaxing the radius and the displacement, while keeping the number of clusters exact. This is a temporal analogue of the -Center problem. We first present a polynomial time ,,-approximation where . In section 3.1 complement this with an inapproximability result.
An auxiliary network flow problem. The high-level idea of the polynomial time algorithm is to use a reduction to a specific network flow problem. Specifically, we seek a minimum flow which satisfies lower bound constraints along certain edges. This is the so-called minimum flow, or minimum feasible flow problem [2, 14]. We now formally define this flow network. For each , let . Let . We construct a flow network, denoted by where is the sequence of centers for . We start with the graph . In level , we replace each vertex for by a pair of vertices and , and we connect them by an edge . For vertices where we define . Now for any vertex , all incoming edges to become incoming edges to , and all outgoing edges from become outgoing edges from . We add a source vertex and a sink vertex . For all , we add an edge from to . Similarly, for all , we add an edges from to . We set the capacity of each edge to be . Finally, we set a lower bound of to the capacity of every edge , for all , , (see Figure 3(b)).
Algorithm. We first compute a net at every level of the temporal-sampling and then we reduce the problem of computing a temporal clustering to a flow instance, using the network flow defined above. By computing an integral flow and decomposing it into paths, we obtain a collection of trajectories. The lower bound constraints ensure that all net points are covered; this allows us to show that all points are covered by the tubular neighborhoods of the trajectories. Formally, the algorithm consists of the following steps:
Step 1: Computing nets. For each , compute a -net of . If for some , , then return .
Step 2: Constructing a flow instance. We construct the minimum flow instance .
Step 3: Computing a collection of trajectories. If the flow instance is not feasible, then return . Otherwise, find a minimum integral flow in , satisfying all the lower bound constraints. Decompose into a collection of paths, each carrying a unit of flow. The restriction of each path in is a trajectory. Output the set of all these trajectories.
Throughout the rest of this section let be a temporal-sampling. We now show that if there exists a temporal ,,-clustering, then the above algorithm outputs a temporal -clustering where .
Lemma 2.1**.**
Suppose that admits a temporal ,,-clustering, . For each let denote the level centers of , and let be a -net of . Then the map which sends each -net center to a nearest center in is injective.
Proof.
First, observe that for each , because -balls centered at the points in cover and hence . For injectivity of , observe that, for because otherwise the inequality holds violating the property that is a -net. ∎
Since for each , the map is injective, it follows that . So, we have the following immediate Corollary.
Corollary 2.2**.**
If admits a temporal ,,-clustering then for any , any -net of has .
Lemma 2.3**.**
If admits a temporal ,,-clustering then for any level-wise -net , the flow instance admits a feasible flow of value .
Proof.
Fix a temporal ,,-clustering and let denote one of its trajectories. The graph contains a path corresponding to as the distance between any pair of consecutive points in is at most . For each , let denote a map which sends each -center of to a nearest center in . We modify to produce some path in as follows: for every level such that for some net-point we let , otherwise we set . We observe that in the worst case the distance between consecutive points, say and , is at most because of the following inequality (see Figure 4) . It follows that is indeed a path in . Further, by the injectivity of each map (Lemma 2.1) which is used in deforming to , we have that for every net point, there exists some that contains it. In other words, all net points are covered by the paths . For each optimal trajectory , let be the path in obtained from by connecting to the first vertex in , and the last vertex in to . By routing a unit of flow in along each such we obtain a flow of value at most that meets all the demands along the edges corresponding to net points , concluding the proof. ∎
Lemma 2.4**.**
Given , , , and a temporal-sampling , with , there exists an -time algorithm that either correctly decides does not admit a temporal ,,-clustering, or outputs some temporal ,,-clustering.
Proof.
Lemmas 2.2 and 2.3 imply that if a temporal ,,-clustering exists, then the algorithm does not return , and thus outputs a set of at most trajectories. Let be the temporal clustering corresponding to . Each trajectory in corresponds to a path in , thus has displacement at most . Therefore . Since is a feasible flow, it follows that all lower bound constraints in are satisfied. Thus for all , for all , there exists at least one unit of flow along the edge corresponding to the vertex ; it follows that there exists some trajectory containing in level . Since for all , is a -net of , it follows that . Thus , which implies that . We thus obtain that is a temporal ,,-clustering. Finally, we bound the running time. Computing the -nets over all levels, checking their sizes can be done in time. Building and can be done in time. Finding an integral solution to takes time using the algorithm of Gabow and Tarjan [14]. Decomposing the resulting flow takes time. We conclude that the entire procedure completes in time. ∎
Writing , we immediately obtain Theorem 1.4.1 from Lemma 2.4.
2.2 Exact radius and displacement: ,,-approximation
In this section we consider the case where the number of clusters is allowed to be approximated in analogy to the static -Dominating Set problem. We present a polynomial-time ,,-approximation algorithm. In Section 3.2 we argue that this result is tight in the sense that obtaining a ,,-approximation is \NP-hard for any fixed .
Let be a temporal-sampling of length . For any , we denote by the set of all trajectories of displacement at most . Given an instance of the Temporal ,,-Clustering problem consisting of a tuple , the high level idea is to express the problem as an instance of Set-Cover. Recall that an instance of Set-Cover consists of a pair , where is a set, and is a collection of subsets of . The goal is to find some , minimizing , such that , if such exists. We set and We will show that a solution to the Set-Cover instance can be used to obtain a temporal ,,-clustering. Note that can have cardinality exponential in the size of the input. However, as we shall see, we can still obtain an approximate solution for in polynomial-time.
We first establish that any -approximate solution to can be converted, in polynomial-time, to a temporal ,,-clustering. Let denote the minimum cardinality of any feasible solution for when it exists. Similarly, let denote the smallest value of such that admits a temporal ,,-clustering.
Lemma 2.5**.**
.
Proof of Lemma 2.5.
Let be a temporal ,,-clustering of , for some . Let be the natural decomposition of into a collection of trajectories, where , for all . Let . Since , it follows that , for all . Thus, , for all , which implies that . Since is a temporal ,,-clustering, it follows that for all , for all , there exists some such that . It follows that
[TABLE]
We have established that is a valid solution for , with , which implies that .
Conversely, let be a solution for . Let . Fix an arbitrary ordering . We may now define the clustering , where , for all , . Since is a feasible solution for , it is immediate that is a temporal ,,-clustering, which implies that . ∎
We next establish the following result which allows us to run the greedy algorithm for Set-Cover on the instance in polynomial-time, even though can be exponentially large.
Lemma 2.6**.**
Let . There exists an time algorithm which computes some , maximizing . Moreover, the algorithm outputs some trajectory , such that .
Proof of Lemma 2.6.
We first establish some notation. Let . Note that due to the orientation of the edges, any path in has at most one vertex at level for any . Recall that this vertex is of the form for some . For convenience we let whenever such a vertex exists. Otherwise, we say that . We extend the notion of a tube to paths in , by defining
[TABLE]
We also denote those elements of which are not covered by as . In other words,
The algorithm computes the desired via dynamic programming, as follows. The dynamic programming table is indexed by . For each , we compute a path in that we store at location of the table. More precisely, if there is no path from to some vertex in , where , then we set . Otherwise, we set to be a path that starts from , and terminates at some vertex , with , maximizing the quantity For every , the only choice for is . Thus we may fill in the entries of the table indexed by , for all . Next, for each , for each , we compute the path to be stored at location , assuming that all the entries indexed by , for all have already been computed. It is immediate for each that starts from and terminates at , we have where is the suffix of obtained after removing . Thus, in order to compute the desired path for that maximizes , it suffices to find the path that maximizes , and starts from some neighbor of in . This completes the description of the algorithm for filling in the values of the dynamic table. We may now set , where is the path stored in the entry , for some that maximizes . We now show how to complete this procedure in time. During a precomputation phase we can construct in time. Further, we can check whether an uncovered point intersects the in constant-time. This allows us to precompute and store for each node in time linear time, for a total of over all nodes. Now note that in populating the table, the algorithm visits each node in and evaluates the choice of taking the path associated with each of its successors in as a suffix. By also keeping the value of a path in the table this decision can be made in constant time using only stored or precomputed information. Thus the algorithm takes time over all. ∎
We are now ready to prove Theorem 1.4.2.
Proof of Theorem 1.4.2.
Recall that the classical greedy algorithm for Set-Cover computes a solution , if one exists, as follows: Initially, we set . At every iteration, we pick some such that is maximized, and we add to . The algorithm stops when either is covered by , or when no further progress can be made, i.e. when ; in the latter case, the instance is infeasible. It is well-known that this algorithm achieves an approximation ratio of for Set-Cover [24]. Now if is infeasible the above procedure detects this and terminates. Otherwise, let be the feasible solution found by repeatedly using the procedure described in Lemma 2.6. The corresponding trajectories returned by this procedure form a temporal ,,-clustering of , for some . By Lemma 2.5 it follows that . Thus we obtain an ,,-approximation. Finally, to bound the running time note that in the worst case, the total number of calls to the procedure in Lemma 2.6 is since at every step we cover at least uncovered point. The theorem now follows by the fact that each call takes time. ∎
2.3 Approximating all parameters: ,,-approximation
So far we have constrained either the number of clusters or the radius and the displacement to be exact. We now describe an algorithm that relaxes all three parameters simultaneously. We present a polynomial-time -approximation algorithm where . We complement this solution later in section 3.3 by showing that it is \NP-hard to obtain a -approximation for any .
Lemma 2.7**.**
If admits a temporal ,,-clustering then for any level-wise -net , the flow instance admits a feasible flow of value .
Proof.
Fix a temporal ,,-clustering . We inductively define a sequence , where for each , is a multiset of paths in . We set , where for each , we have . Next, we inductively define , for some . Starting with , we proceed to modify . By induction, it follows that the paths , , and share the same suffix at levels . Thus, and . Now, for the modification, we consider each , and proceed as follows (see Figure 5 for an illustration). Since is a valid temporal ,,-clustering, it follows from Lemma 2.1 that there exists an injective map from to the set so that for some and . We consider the following two cases:
Case 1: If is odd and for some , then we modify by replacing the vertex with .
Case 2: If is even and for some , then we modify by replacing the vertex with .
We next argue that the result is indeed a path in . Suppose that in the above step, we modify the path , for some so that . It follows by induction on that the path was not modified when constructing ; thus . Since , it follows by the triangle inequality that It follows that , which implies that each element of is indeed a path in . This completes the inductive definition of the multisets . It is immediate by induction that for each , for each , there exist some path that visits . We next transform the collection into a flow in . For each path , we obtain a path in the network starting from the source , then replacing for each , each by the edge , for , then terminating at the sink ; we route a unit of flow along the resulting path. Since for each , there exists some path in visiting each , it follows that all lower-bound constraints in are satisfied by . Since contains paths, it follows that the value of the resulting flow is , as required. ∎
We are now ready to prove Theorem 1.4.3.
Proof of Theorem 1.4.3.
For each , compute a -net of , and construct the flow network . Compute a minimum flow in satisfying all lower-bound constraints. If is infeasible (i.e. if there is no flow satisfying all lower bound constraints), or if the the value of the minimum flow in is greater than , it follows by Lemma 2.7 that does not admit a temporal ,,-clustering. Thus, in this case the algorithm terminates. Otherwise, we compute a minimum flow in . Since all capacities and lower-bound constraints in are integers, it follows that can be taken to be integral. We decompose into a collection of at most paths, each carrying a unit of flow. Arguing as in Lemma 2.4 we have that the restriction of these paths on is a set of trajectories that induces a valid temporal ,,-clustering of . This provides a ,,-approximation where . Finally, the running time is easily seen as by the same argument that appears in Lemma 2.4, concluding the proof. ∎
2.4 Approximation algorithm for temporal median clustering
In this section we consider variants of Temporal Clustering which evaluate the spatial cost of clustering by taking the level-wise maximum of discrete -median and discrete -means objectives. A natural question is whether or not the problem admits a -approximation via local search, as in static case [4, 27]. In Figure 6 we show that the local search approach fails, even on temporal samplings of length two. Instead, the result is obtained by iteratively selecting a trajectory which most improves a certain potential function. The result in this section is presented for the Temporal ,,-Median Clustering problem, and follows by submodularity and monotonicity of the potential function. These properties remain if is replaced with the , and thus holds identically for Temporal -Means.
We now present an approximation algorithm for the Temporal ,,-Median Clustering problem. Let be an input to the problem, where is a temporal-sampling of length . Let denote the size of the . Let also denote the spread of . That is, Since we only consider finite metric spaces, and since the single point case is trivial, w.l.o.g. we may assume that the diameter of is and minimum interpoint distance in is . For a set of trajectories we define We also define Intuitively, the quantity measures how far the solution is from the optimum; in particular, if then the spatial cost is within the desired bound.
Lemma 2.8**.**
The set function is submodular.
Proof.
Since the sum of submodular functions is submodular, it is enough to show that is submodular. Thus it suffices to show that is submodular, and thus it suffices to show that , for all , which is immediate since . ∎
Algorithm. Our goal is to compute some set of trajectories such that is sufficiently small, while minimizing . The algorithm consists of the following steps:
Step 1. Let be a set containing a single arbitrary trajectory.
Step 2. For any , let be a minimizer of . Set
Step 3. Return .
The parameter will be determined later.
The following Lemma bounds the running time of Step 2.
Lemma 2.9**.**
Given a clustering , we can find minimizing , in time .
Proof.
This can be done via dynamic programming. The proof is essentially the same as in Lemma 2.6 and is thus omitted. ∎
We next show that for some value of the parameter , the algorithm computes a solution with low cost. To that end, we show in the next Lemma that at each iteration of the main loop the quantity decreases by a significant amount.
Lemma 2.10**.**
If admits a temporal -median-clustering, then for any , there exists some feasible trajectory such that .
Proof.
Let be a set of at most trajectories that yields a -median temporal clustering. W.l.o.g. we may assume that . Let , and for any , let Since is a -median temporal clustering, it follows that For any , we also define . By Lemma 2.8 we have that for all , . That is, . Let It follows that Let . It immediately follows that , concluding the proof. ∎
We are now ready to prove Theorem 1.6.
Proof of Theorem 1.6.
We first note that if , then a solution with trajectories can be computed, if one exists, as follows: Since , it follows that every level of has at most points. We construct the flow network instance , as in Section 2.1. It is immediate that the flow instance is feasible iff there exists a solution with trajectories. For the remainder of the proof we may thus assume that . Since the minimum distance in is 1, it follows that . In a generic step , let denote the trajectory returned by the dynamic program of Lemma 2.9, which minimizes . By Lemma 2.10 we have that if admits a temporal -median-clustering, then there exists some trajectory such that . Thus . Since the diameter of is , we get . Setting , we obtain Thus which implies . It follows that either the computed solution is a -approximation, or does not admit a temporal -median-clustering, as required. Finally, the bound on the running time follows by the fact that we perform iterations of the main loop; the running time of each iteration is bounded by Lemma 2.9. ∎
3 Inapproximability
In section 3.1 we show that temporal-clustering with an exact number of clusters is \NP-hard to obtain a -approximation (Theorem 1.5.1), complementing Theorem 1.4.1. In section 3.2 we argue that the -approximation given in Theorem 1.4.2 is best possible by observing that -approximation is \NP-hard (Theorem 1.5.2), though the construction involves a somewhat unnatural metric space. In section 3.3 we show that -approximation is \NP-hard even for points sampled from -dimensional Euclidean space (Theorem 1.5.3). In section 3.4 (Theorem 1.7.1) and section 3.5 (Theorem 1.7.2), we adapt the hardness results for -approximation, and -approximation to the Temporal -Median setting. We observe that these constructions involve clusterings which use only a constant number of points per cluster, thus the same constructions suffice to prove hardness of Temporal ,,-Means Clustering with only slight modification of the distances.
3.1 Inapproximability with exact number of clusters
We complement Theorem 1.4.1 by showing that it is \NP-hard to obtain a -approximation. Further, this inapproximability result holds even for a temporal sampling in . Let be such a sample consisting of points. We show that given any fixed and there exists universal constants , such that the ,,-approximation problem is -hard. We describe a polynomial-time reduction from instances of - to temporal-samplings over . In particular, given any positive real numbers , , , and , and some -variable instance of -, we construct a temporal sampling such that the following conditions hold:
admits a temporal ,,-clustering if is satisfiable. 2. 2.
does not admit a temporal ,,-clustering otherwise.
Suppose we are given an instance of - with variables and clauses. W.l.o.g., we assume that every clause contains no repeated variables. We now describe how to produce the corresponding temporal-sampling :
Variable gadgets. For each variable of we introduce a pair of points in each level of . We denote these points by and and the pair by . We will sometimes refer to these points as literals. Initially, we lay out the variable gadgets in the plane such that for all , and for all . Here, denotes a yet to be determined function of the number of variables (see Figure 7(c)). We refer to this configuration as initial position.
Clause gadgets. Order the clauses of arbitrarily as . For each clause we build a series of levels for the temporal-sampling where the variable gadgets of that clause appear to undergo rigid motion see (Figure 7(a)). By motion we mean that points in any pair of subsequent levels which correspond to the same literal are within a bounded distance of each other. Further, we say it is rigid because in every level we maintain the distance of between literals of the same variable gadget. Note that we may enforce a consistent assignment of a center to a literal between consecutive levels by ensuring that the distance between a literal and its copy in the next level is at most , and that the distance between points corresponding to distinct literals exceeds . We describe this motion in three phases:
Phase 1: Assembly. All points start in the initial position (see Figure 7(c)). The variable gadgets which are used in the specified clause are brought together one by one under rigid motion (see Figure 7(a)). This motion brings the ends of the variable gadgets which correspond to the literals that appear in the clause to a single common point. The unused literals appear on a circle of radius about the common point, arranged such that they form an equilateral triangle (see Figure 7(c)). This phase takes steps per clause.
Phase 2: Satisfiability check. An extra point is introduced at the location where the variable gadgets meet. This point then undergoes motion directly away from one of the unused literals for a distance of , before reversing course and returning to the common point at the center of the gadget (see Figure 7(c)). It subsequently disappears. This phase takes steps per clause.
Phase 3: Disassembly. The motion of Phase 1 is reversed and the variable gadgets are returned to initial position in steps per clause.
Analysis. Let denote the temporal-sampling of given by the above construction, and let denote the smallest value of such that admits a temporal ,,-clustering.
Lemma 3.1**.**
If is satisfiable then .
Proof.
Since is satisfiable there exists a satisfying assignment. We now exhibit a solution of cost at most . For each of the variable gadgets, we set one of the two points to be a center in the initial configuration. If is True in the satisfying assignment then the point which corresponds to the literal is selected as a center. Otherwise, is selected. Note that the construction of the gadget ensures that the same choice of literals can be maintained throughout the entire motion. We maintain these choices during Phase 1 and Phase 3 where the only points which appear are from variable gadgets. Since at least one side of each variable gadget is covered and the distance between the points of these gadgets are at all times, the covering cost of each level in these phases is . By satisfiability there is at least one center at the common point where the clause literals meet. We take one such center and assign it to the extra point. Throughout Phase 2 the extra point is covered and the clause gadget retains at least one center (somewhere) at a coverage cost of at most . ∎
Lemma 3.2**.**
If is not satisfiable then .
Proof.
We will show that a satisfying assignment can be inferred from a clustering with cost below . First, we argue that the variable gadgets are consistent in initial position. Since , it follows that whenever the point configuration is in initial position, every variable gadget has exactly one center. Thus we do not simultaneously select a literal and its negation, as otherwise it follows by the pigeonhole principle that at least one variable gadget is uncovered and . Next, we argue that these choices must remain consistent within a clause gadget . The only opportunity for a trajectory to change literals is when two literals are within a distance of , and the only literals in the gadget which pass within are , , and . To see that inconsistency is expensive, assume that is a center in the first level of the gadget but not in the last, then the variable gadget which contains literal is uncovered in the last level for the clause, implying . Moreover, since the choice of cluster centers at the end of clause gadget must be the same as at the beginning of for . We now argue that at least one of the literals for a clause is a center. Suppose none of them are a center, then the extra point from Phase 2 never coincides with a center as and the closest possible location for a center to any point along its motion is at a distance of away. At its maximum displacement the extra point is at a distance of from where the literals meet, which is at least from any other center. Thus, . We produce a satisfying assignment by setting true those literals which are assigned centers in . ∎
Remark 3.3**.**
*We remark that Lemma 3.1 and Lemma 3.2 continue to hold even for for any . Essentially, there are two important distance scales given by and . Points in successive levels closer than can be assumed to be and vice versa. *
Lemma 3.4**.**
*Given an instance of - with variables and clauses. It is possible to construct the above temporal-sampling of size in -time. *
Proof.
This is immediate by construction, as each clause gadget consists of levels of size and there are clauses. ∎
We are now ready to prove Theorem 1.5.1.
Proof of Theorem 1.5.1.
Let . Let be an instance of - with variables and clauses. We invoke Lemma 3.4 with , and to yield, in polynomial-time, a temporal-sampling over of size . Note here we have used the fact that . Suppose the existence of some polynomial-time ,,-approximation where , and . There exists a universal constant (where by construction) such that . Thus we let , Remark 3.3 indicates that Lemma 3.1 and Lemma 3.2 still apply. We run this approximation on with , , and . In polynomial-time the algorithm either produces output or fails. If the algorithm fails it follows by Definition 1.3 that does not admit a temporal ,,-clustering and thus is not satisfiable by Lemma 3.1. Otherwise, the algorithm produces some clustering of radius . Lemma 3.2 ensures that if is not satisfiable then for some universal constant . It follows that if the algorithm produces output if and only if is satisfiable, giving a polynomial-time test for satisfiability. ∎
3.2 \NP-hardness of ,,-approximation
We now argue that the ,,-approximation is tight. The -Dominating Set problem for a metric space is to find a smallest set of points such that every point in is at most a distance from a point in . It is known that Set-Cover reduces to -Dominating Set in polynomial-time [10]. For completeness we now give such a reduction:
Folklore 3.5**.**
Set-Cover* reduces to -Dominating Set in polynomial-time. Moreover, any -approximate solver for -Dominating Set yields an -approximation for Set-Cover.*
Proof.
Let be an instance of Set-Cover, where . We define a metric space where contains points corresponding to the elements of , and , and where satisfies the following:
for all , . 2. 2.
for any , such that . 3. 3.
Otherwise, for any remaining pair of distinct points .
These constraints on , together with the requirement that it be a metric on , completely determine . (See Figure 8 for an illustration.)
Let be a feasible solution to the Set-Cover instance. Note that induces a -Dominating Set solution of size by taking the set of points in which correspond to the points in . On the other hand, any feasible solution of -Dominating Set can be converted into a feasible solution of Set-Cover in linear time without increasing its size. To see this fix a feasible solution , and let . Note that a -ball of in consists only of and points for elements of which cover it. Instead we can cover by any other point in its -ball, . Since the feasibility of the solution is preserved. Performing this replacement for all points in induces a feasible solution to Set-Cover of size no more than . It follows that any -approximate solver for -Dominating Set yields an -approximation for Set-Cover. ∎
We now prove Theorem 1.5.2.
Proof of Theorem 1.5.2.
As a corollary to Folklore 3.5, any polynomial-time ,,-approximation for -Dominating Set yields a polynomial-time -approximation for Set-Cover problem, by taking to be a single level temporal-sampling with and invoking it with , , and successive values of until it succeeds in producing a clustering. Since the first non-failing value of is at most the size of an optimum solution to Set-Cover, the resulting clustering is at most times larger. The hardness now follows for any by a result of Dinur and Steurer [8]. ∎
3.3 Inapproximability in -dimensional Euclidean space
While Theorem 1.5.2 shows that an increase in the number of clusters should be expected if we demand to have a polynomial-time algorithm that closely approximates the optimal radius and displacement, the construction involves a somewhat unnatural metric space. We show that this condition remains even for -dimensional Euclidean space.
Theorem 3.6** (Hardness of MAX--[6]).**
There exist constants , satisfying and , such that it is \NP-hard to decide whether a given -CNF formula admits an assignment which satisfies at least -fraction of the clauses, or whether any assignment satisfies at most -fraction of clauses.
Let be an instance of MAX-- consisting of variables and clauses. Given , our goal is to construct a temporal-sampling in polynomial-time such that:
admits a temporal ,,-clustering if there exists a truth assignment that satisfies at least clauses of . 2. 2.
does not admit a temporal ,,-clustering for any and , if every truth assignment satisfies at most clauses,
where and are the constants from Theorem 3.6
Variable gadgets.
For each variable of , let be the number of literals where it appears. We introduce pairs of points into each level of , which we denote by and for . Our initial arrangement of variable gadgets in the first level of is as in the top row of Figure 9.
Clause gadgets.
We arrange a configuration of points for each clause of by rigidly transporting one of the variable gadget copies of each of its variables to a predetermined location. In doing so we overlay them so that the points of each gadget corresponding to and overlap on one side (see Figure 9). The distance between two neighboring clause gadgets is .
Phase 1: Consistency checking and clause assembly. Each variable in corresponds to one or more variable gadgets (which we think of as copies). We check each pair of variable gadget copies to ensure that all copies have at least one side selected as a center. That is, either has a center for all , or all do (or both). Initially, we declare all variable gadget copies corresponding to the same variable as “unchecked”. For each variable we perform the following procedure: The unchecked copy with smallest index (the left most one, see Figure 9) undergoes rigid motion so that it is aligned end to end with a higher indexed copy . The two copies align with a distance of in-between. Moreover, the gadgets are consistently oriented with on the left and on the right. A consistency check is performed between the pair consisting of the following steps: An additional point is introduced for one level at the mid-way point between the two gadgets. It subsequently disappears and the two variable gadgets simultaneously rotate in place in the same direction about their respective midpoints so that they are both oriented in with on the left. The additional point reappears for a single level. After this the variable gadget copies rotate once more in place so that is on the right. The -th copy now undergoes rigid motion to check against the next copy (-st, if it exists). After all has checked with all , it is declared “checked” and undergoes rigid motion to take its place in the clause gadget. This process repeats until all copies of the variable are checked. Finally, the lone remaining copy undergoes rigid motion to its place in some clause gadget. The total traveled distance for any copy of a variable gadget is . Since it may only do so in steps of size , the total number of steps per copy is for a total of over all variable copies.
Phase 2: Satisfiability check. An extra point is introduced in each clause gadget. For each clause , this occurs at the point where the sides of the variable gadget copies corresponding to and have been identified. These points undergo motion along a line orthogonal to the variable gadget copies for a distance of (see Figure 9). This phase takes steps.
Lemma 3.7**.**
If there exists a truth assignment satisfying at least clauses then admits a temporal ,,-clustering.
Proof.
Suppose such a truth assignment, , exists. We use this assignment to determine of the cluster centers. For each variable , if is True we select the points for all as centers in all levels up to and including the first Phase 2 level. Otherwise we select all for all as centers in those same levels. With these selections each satisfied clause has at least one center present at the extra point of the clause gadget at the start of Phase 2. Hence, for satisfied clauses we arbitrarily select one of these centers and assign it to the extra point for all remaining levels. For the unsatisfied clauses we assign one of the remaining centers. Note that the ends of the variable gadgets that meet at the extra point of each unsatisfied clause correspond to (thus far) unselected literals. We select one of those literals from each clause to be a center in all levels up to and including the first Phase 2 level, and select each unsatisfied clause’s extra point in the remaining levels. The remaining centers can be assigned arbitrarily to some in all levels. Let the clustering induced by this choice of centers be denoted as .
We now argue that . First note that for each variable, , for any either the point or is a center by assignment from . Since the variable gadgets only undergo rigid motion, any level of with only the points from the variable gadgets is covered at a cost of at most . We need only consider the levels of where additional points exist. Since the assignment from is consistent, it follows that the extra points introduced during the consistency check involving and for are always within a distance of of their corresponding centers. For the levels in Phase 2 all extra points are covered and at least one center remains on each clause gadget, which has diameter . ∎
Lemma 3.8**.**
If every truth assignment satisfies at most clauses then does not admit a temporal ,,-clustering for ,
Proof.
We argue that if admits temporal ,,-clustering for some , , then there exists a truth assignment that satisfies more than clauses. For any variable, all copies of its gadget in the first level contain at least one center, as otherwise one of them is completely uncovered and the nearest center is at least a distance of away. Note that the only points of a variable gadget which pass within a distance of another are points which are eventually identified within a clause gadget (just before Phase 2). Thus, the choice of centers in the first level completely determines the choice of centers at the start of Phase 2. Further, this assignment is consistent in the sense that for each variable either for all are centers, or all of for all are (this is not mutually exclusive, some variable gadgets might have both). To see why suppose that for some variable there is a pair of copies corresponding to which disagree. In this case each copy has exactly one center. Without loss of generality suppose and have centers. There is a level where the extra point of the consistency check is exactly midway between the opposite ends of these copies. That is, the extra point is midway between and . It follows that the nearest centers to the extra point are at a distance of contradicting that . Finally, note that the extra point of each clause gadget is covered. This is only possible if at least one of the literals that meet at the extra point have a center. Now consider the truth assignment , where is True if and only if all for all are centers in Phase 1. Let denote the number of clauses satisfied by . We know that this assignment together with additional centers is sufficient to satisfy all clauses. That is, . Since there are only centers in total, it follows that the number of additional centers required is strictly less than . Thus, , and we conclude that . ∎
Lemma 3.9**.**
*Given an instance of MAX-- with variables and clauses, it is possible to construct the above temporal-sampling of size in -time. *
Proof.
This is immediate by construction. ∎
We are now ready to prove Theorem 1.5.3.
Proof of Theorem 1.5.3.
Let with , be given. Let . Let be an instance of MAX-- with variables and clauses which is promised to either admit a solution which satisfies at least clauses (high satisfiability) or does not admit any solution which satisfies more than clauses (low satisfiability), for and from Theorem 3.6. Let be the result of the above construction with . It follows from Lemma 3.9 that the size of is since . From this and the choice of there exists a universal constant such that . Suppose there exists some ,,-approximation for some with . Run the approximation on with , , . Then either it fails and Lemma 3.7 implies there does not exist a truth assignment satisfying at least clauses (thereby deciding that it is a low satisfiability instance in polynomial-time), or it outputs a clustering . Since , it is easy to see that Lemma 3.7 and Lemma 3.8 still apply. If is a high satisfiability instance then it follows by Lemma 3.7 that the resulting clustering consists of at most clusters with . Otherwise by Lemma 3.8 it cannot consist of fewer than centers with . Thus it is possible to distinguish high satisfiability instances from low satisfiability instances provided that and . The former inequality holds by choice of . From the later inequality it follows that the input classes will be distinguishable provided that . The theorem follows by setting to be the infimum of the right hand side subject to the constraints placed on and in Theorem 3.6. ∎
3.4 Inapproximability with exact number of clusters for Temporal -Median
We remark here that the construction of section 3.1 which demonstrates \NP-hardness of -approximation works for Temporal -Median with only minor modification.
Given any positive real numbers , , , and , and some -variable instance of Exact-3-SAT, using essentially the same construction as used in section 3.1 we show how to construct a temporal sampling such that the following conditions hold:
admits a temporal ,,-clustering if is satisfiable. 2. 2.
does not admit a temporal ,,-clustering otherwise.
The only difference will be in our selection of the value of the constant , which we determine later.
Lemma 3.10**.**
If is satisfiable then admits an temporal ,,-clustering.
Proof.
We make the identical assignments as Lemma 3.1, the only difference here is that we use the -median objective. The cost of covering the variable gadgets in Initial Position, Phase 1, and Phase 3 are . In Phase 2 one of the remaining centers near where the clauses meet needs to cover the points of the variable gadget which gives its center to the extra point. The cost of this level is . ∎
Lemma 3.11**.**
If is not satisfiable then does not admit an temporal ,,-clustering.
Proof.
The proof is identical to Lemma 3.2, except we use the -median objective and require that so that the spacing of the variable gadgets in initial position exceeds the cost of their -median clustering. ∎
Since the construction is the same as in section 3.1 up to the choice of , we can apply Lemma 3.4 to conclude .
We are now ready to prove Theorem 1.7.1.
Proof of Theorem 1.7.1.
Let . Let be an instance of - with variables and clauses. We invoke the construction of section 3.1 with , and yield, in polynomial-time, a temporal-sampling over of size . Note here we have used the fact that . Suppose the existence of some polynomial-time ,,-approximation where , and . There exists a universal constant (where by construction) such that . Thus we let , As with the remark of section 3.1, Lemma 3.10 and Lemma 3.11 still apply. We run this approximation on with , , and . In polynomial-time the algorithm either produces output or fails. If the algorithm fails it follows by Definition 1.3 that does not admit a temporal ,,-clustering and thus is not satisfiable by Lemma 3.10. Otherwise, the algorithm produces some clustering of radius . Lemma 3.11 ensures that if is not satisfiable then . Thus provided that , the algorithm produces output if and only if is satisfiable, giving a polynomial-time test for satisfiability. That is, whenever
[TABLE]
for some constant . ∎
3.5 \NP-hardness of -approximation for Temporal -Median
In this section we show that it is is \NP-hard to simultaneously approximate both the number of clusters and the spatial cost arbitrarily well.
Definition 3.12**.**
A -regular graph is an -expander if for every set where at least edges connect and .
Let be an instance of MAX-- consisting of variables and clauses. Given , our goal is to construct a temporal-sampling in polynomial-time such that:
admits a temporal ,,-clustering if there exists a truth assignment that satisfies at least clauses of . 2. 2.
does not admit a temporal ,,-clustering for any and , for some universal constant , and any , if every truth assignment satisfies at most clauses.
Here, and are the constants from Theorem 3.6.
Variable gadgets.
For each variable of , let be the number of literals where it appears. We introduce pairs of points into each level of , which we denote by and for . We set the distance for all in all levels. We think of each pair as a variable gadget for the variable so that any level contains “copies” of this gadget.
Clause gadgets.
We arrange a configuration of points for each clause of by transporting one of the variable gadget copies of each of its variables to a predetermined location. Ultimately, we overlay them so that the points of each gadget corresponding to and overlap on one side, with , overlapping on the other.
Phase 1: Initial layout of . Fix an . To determine the initial distances among the variable copies we generate a -vertex -regular -expander, for each variable . Fix bijections from to the set of copies of variable gadgets for in . For each edge we introduce a pair of auxiliary points , into and set the distances between these points and the points of variable gadgets , and according to Figure 10. For points of variable gadget copies from distinct variables , , we set for some to-be-determined constant, . We think of this process of specifying distances as building a weighted graph on the points of , where the weights are given by the distance values. Thus we allow any remaining unspecified distances of to be given by the shortest paths distance in this graph.
Phase 2: Isolation of variable gadgets. In all variable gadget copies are separated by distances of at least . In this phase we simultaneously expand the distances between the points of any pair of variable gadget copies to in steps of size at most . The only points present in these levels are the points of the variable gadget copies. The number of levels required for this stage is
Phase 3: Condensation of clause gadgets. For each clause . Let , denote the variables referenced by literals , and , respectively. We select , corresponding to yet unused copies of variable gadgets and , respectively. By a slight abuse of notation we label the points corresponding to and with the labels , . In other words corresponds to if it is negated, otherwise with defined analogously. We transport each variable gadget so that the points of each which correspond to and overlap on one side, with , overlapping on the other. We call the point where and overlap the location of the clause. The number of levels required for this stage is
Phase 4: Clause verification. Once the clause gadgets have been assembled, we introduce an extra point at the location of each of clause. These extra points move directly away from their respective clauses for a distance of . The number of levels required for this stage is
Let denote the temporal-sampling given by the above construction.
Lemma 3.13**.**
If there exists a truth assignment satisfying at least clauses, then admits a temporal ,,-clustering.
Proof.
Let be a satisfying assignment which satisfies at least -fraction of the clauses. We now use to construct a clustering with , . Introduce trajectories such that
[TABLE]
for all , for all , and for all starting at the first level of Phase through the last level of Phase . Note that since satisfies , every satisfiable clause has at least one trajectory at its location at the end of Phase . We extend one such trajectory to cover the clause’s extra point throughout Phase 4. The other trajectory which is near to the clause maintains the value that it had at the end of Phase 3 through all levels of Phase 4. This assigns of the at most trajectories. We will now use the remaining to pay for the unsatisfiable clauses. For each clause gadget corresponding to some unsatisfied clause, select one of the literals at its location (that is, is either or for some , ). We introduce a trajectory into the clustering such that from the first level of Phase through the last level of Phase , and which covers the clause’s extra point throughout each level of Phase . This completes our assignment. Since there are no more than unsatisfied clauses, the size of the clustering is at most , as desired. Further, by construction the displacement between trajectory centers on adjacent levels is at most . Thus .
We now argue that . Note that in all levels, all points are within a distance of from some center. In particular, by assignment from , it holds that for any either has a center for all or has a center for all for all levels of the first three phases. Thus each variable gadget has a center at at least one end, so we can cover all variable gadget points at a cost of . This same cost of covering the variable gadgets also holds in Phase , as at least one trajectory remains incident to the overlapping pair of variable gadgets. Since the second through last levels of Phase , all levels of Phase 2, and all levels of Phase 3 only contain points which correspond to variable gadgets, these levels be covered at a cost of . This bound also extends to the points of Phase 4 since any extra points which appear in those levels are centers of some trajectory by construction. It only remains to bound the cost of . Recall that there are pairs of extra points in which correspond to edges of each expander. The fact that all of either the True or False sides of each variable gadget receive a center, implies that all extra points are at a distance of to a center. Thus the cost of covering the first level is at most . Since each is -regular, . Thus, . ∎
Lemma 3.14**.**
If every truth assignment satisfies at most clauses then, for any , does not admit a temporal ,,-clustering.
Proof.
Let . We will argue that if admits a temporal ,,-clustering, , for some , and , then there exists a truth assignment which satisfies more than clauses. First note that the distance from any extra point of to its nearest neighbor in is . Thus, since , no such point has a feasible successor. It follows that none of these points appear in the trajectories of . Further, there is a unique valid successor for all variable gadget points in all levels of Phase , Phase , and all but the later the levels of Phase when the corresponding ends of the overlaid variable gadgets come within of each other. Since , it must be the case that the farthest distance to any cluster center is within . In particular this means that every variable gadget has been assigned a center. As otherwise some variable gadget at the end of Phase is covered by a center of another and the nearest one is at a distance of away. Further, since the extra point of Phase also moves a distance of , it must also be the case that some trajectory is incident to the location of each clause.
We would now like to infer a truth assignment from , but the main obstacle is that both and can be trajectories. We will show that the total discrepancy is bounded, and that even after accounting for “unfairly” satisfied clauses, it is possible to construct a satisfying assignment which satisfies more than clauses. To see this, note that for each pair of trajectories corresponding to the same variable which disagree and share an edge in , the gadget in Figure 10 costs an additional to cover. For each let denote the subset of vertices of which correspond to the minority assignment. That is, is the subset of vertices of which correspond to the subset of either True or False variable gadgets, whichever has smaller cardinality. It follows that is an upper bound on the total fraction of clauses satisfied by inconsistent trajectories. The additional coverage cost of the inconsistencies is equal to the number of edges which cross the -cut, which, since each is a -regular -expander, is at least . Since , we have that . Thus, . By setting , is an upper bound on the fraction of clauses flipped by inconsistent trajectories. Let such that
[TABLE]
Consider a truth assignment such that , and let denote the number of clauses satisfied by . Since it must be the case the extra points for all clauses are covered. That is, together with additional trajectories is sufficient for satisfying all clauses, so that . Since we have that . Combining both inequalities we see that . It immediately follows that . ∎
We are now ready to prove Theorem 1.7.2.
Proof of Theorem 1.7.2.
Let be given and suppose that there exists a polynomial-time -approximation. Let be an instance of MAX-- which is promised to either admit an assignment which satisfies at least fraction of the clauses, or does not admit an assignment which satisfies more than fraction. Let . Construct from in time. Note that we are free to take in the construction of as small as we like, provided that the number of levels of remains polynomial in the size of . Using this freedom we take such that . Run this approximation on with , , . If the approximation fails then it must be the case that is a No instance. Otherwise, the approximation produces a clustering, , with at most trajectories, , and . Note that Lemma 3.14 and Lemma 3.13 still hold since . Thus, provided that , and it is possible to distinguish Yes MAX-- instances from No instances. That is, for and . Taking completes the proof. ∎
4 Conclusion
Our results show that in many cases temporal clustering problems are hard to approximate. On the other hand, our polynomial time algorithms show that in some cases if we allow approximations in terms of parameters like or the spread , the approximation becomes tractable. We wish to better understand the boundary between these cases. Another direction comes from altering the model. We currently consider trajectories consisting of points in the input; an alternative formulation could allow centers from the ambient metric space. We plan to investigate this model in future research.
Acknowledgements
This work was partially supported by the NSF grants CCF 1318595, CCF 1423230, DMS 1547357, and NSF award CAREER 1453472.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] P. K. Agarwal, L. J. Guibas, H. Edelsbrunner, J. Erickson, M. Isard, S. Har-Peled, J. Hershberger, C. Jensen, L. Kavraki, P. Koehl, M. Lin, D. Manocha, D. Metaxas, B. Mirtich, D. Mount, S. Muthukrishnan, D. Pai, E. Sacks, J. Snoeyink, S. Suri, and O. Wolefson. Algorithmic issues in modeling motion. ACM Comput. Surv. , 34(4):550–572, December 2002.
- 2[2] A. V. Aho and D. Lee. Efficient algorithms for constructing testing sets, covering paths, and minimum flows. Bell Lab. Tech. Memo , 159, 1987.
- 3[3] D. Arthur and S. Vassilvitskii. k-means++: The advantages of careful seeding. In Proc. 18th Annu. ACM-SIAM Sympos. Disc. Alg. , pages 1027–1035. SIAM, 2007.
- 4[4] V. Arya, N. Garg, R. Khandekar, A. Meyerson, K. Munagala, and V. Pandit. Local search heuristics for k-median and facility location probs. SIAM J. Comput. , 33(3):544–562, 2004.
- 5[5] J. Basch, L. J. Guibas, and J. Hershberger. Data structures for mobile data. In Proc. 18th Annu. ACM-SIAM Sympos. Disc. Alg. , SODA ’97, pages 747–756. SIAM, 1997.
- 6[6] M. Bellare, O. Goldreich, and M. Sudan. Free bits, pcps, and nonapproximability—towards tight results. SIAM J. Comput. , 27(3):804–915, June 1998.
- 7[7] S. Cabello, P. Giannopoulos, C. Knauer, D. Marx, and G. Rote. Geometric clustering: fixed-parameter tractability and lower bounds with respect to the dimension. ACM Trans. Algs. (TALG) , 7(4):43, 2011.
- 8[8] I. Dinur and D. Steurer. Analytical approach to parallel repetition. In Proc. 46th Annu. ACM Sympos. Thry. Comput. , STOC ’14, pages 624–633. ACM, 2014.
