On the optimality of some group testing algorithms
Matthew Aldridge

TL;DR
This paper analyzes the optimality of group testing algorithms, showing that the DD algorithm is suboptimal for certain parameters and that SCOMP and LP-based algorithms are at least as effective for large defect rates.
Contribution
The paper provides new bounds on the rate of the DD algorithm and establishes the optimality of SCOMP and LP-based algorithms for specific defect proportions.
Findings
DD is suboptimal for .41 < heta < 0.5
SCOMP and LP algorithms achieve at least the same rate as DD for heta .5
New upper bounds on the rate of the DD algorithm are derived.
Abstract
We consider Bernoulli nonadaptive group testing with defectives, for . The practical definite defectives (DD) detection algorithm is known to be optimal for . We give a new upper bound on the rate of DD, showing that DD is strictly suboptimal for . We also show that the SCOMP algorithm and algorithms based on linear programming achieve a rate at least as high as DD, so in particular are also optimal for .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the Optimality of Some
Group Testing Algorithms
Matthew Aldridge
Department of Mathematical Sciences
University of Bath, Bath, U.K.
Heilbronn Institute for Mathematical Research
Bristol, U.K.
Email: [email protected]
Abstract
We consider Bernoulli nonadaptive group testing with defectives, for . The practical definite defectives (DD) detection algorithm is known to be optimal for . We give a new upper bound on the rate of DD, showing that DD is strictly suboptimal for . We also show that the SCOMP algorithm and algorithms based on linear programming achieve a rate at least as high as DD, so in particular are also optimal for .
I Introduction
Group testing is a sparse inference problem that involves detecting the defective items in a population through the means of pooled tests. By testing a ‘pool’ – that is, a subset – of items, we discover whether that pool is free of defective items or whether it contains at least one defective item. After a sequence of such tests, we must infer (or ‘detect’) the true set of defective items. The goal is to find the defective set with low probability of error using as few tests as possible.
This naturally splits into two problems: first, designing the testing strategy, and second, detecting the defective items, given the test design and the results of the test.
For the design, here we consider Bernoulli nonadaptive testing. In nonadaptive testing, the test pools are all defined in advance, so the tests can be carried out in parallel. Bernoulli nonadaptive testing is the simplest form of nonadaptive testing, where each item is placed in each test independently with some fixed probability . Bernoulli nonadaptive testing is simple, order optimal for all (see [1, 2, 3, 4]), even has optimal constants for (see [4, 5, 6]), and has been studied by many authors (see [1, 7, 2, 3, 4, 5, 6, 8, 9, 10, 11] for just a few examples).
For the detection problem, we consider three detection algorithms: one called DD (for ‘definite defectives’), one called SCOMP (which can be seen as a Sequential version of the earlier COMP algorithm), and approaches based on linear programming. In contrast to the theoretical algorithms often used to prove achievability results, these three algorithms are practical, in that they are computationally efficient and do not require knowledge of the true number of defectives. It is important to know when these practical algorithms are as good as the theoretically optimal but impractical detection algorithms. Here, we study this question in the context of Bernoulli nonadaptive testing. It is known that DD is optimal for denser cases, where we have more defectives. Our first result here is that DD is strictly suboptimal for sparser cases. Our second result is that SCOMP and linear programming approaches are also optimal in denser cases.
We begin with some notation. There are items, of which are defective, and we use tests. We assume that defectivity is rare, in that , and concentrate on the case where scales like for some sparsity parameter . (The case behaves much the same, but one has to be a little careful over limiting arguments, as does not tend to as grows. We omit this case for brevity, although the results of this paper do carry over.) We assume that the true defective set is chosen uniformly at random for all subsets of size .
A useful way to keep track of the design of the test pools is to write to denote that item is in the pool for test , and to denote that item is not in the pool for test .
Definition 1
In Bernoulli nonadaptive group testing we set with probability and with probability , independently over and , for some fixed .
Recall that in standard group testing, a test outcome is positive if at least one item from is in the test, and negative if no items from are in the test. That is, writing for the outcome of test , we have
[TABLE]
In general, since there are possible defective sets and possible outcomes from tests, we see that we have a lower bound on the number of tests required to reconstruct the defective set of
[TABLE]
One way of interpreting this is to say that specifying the defective set requires bits of information, and each test, with a yes–no answer, can impart at most bit of information. (From now on, always means .) Given this, for a scheme that uses tests, we can regard the ‘bits learned per test’ as the rate of group testing. (This terminology follows that of Baldassini, Johnson, and Aldridge [12].)
Definition 2
The average error probability is defined as follows: if our estimate of the defective set is , then the error probability is
[TABLE]
where the probability is over the random choice of test design.
Definition 3
Given a detection algorithm, we say a rate is achievable if, for any , then for sufficiently large and , there exists a Bernoulli parameter and a number of tests with rate
[TABLE]
and Bernoulli nonadaptive testing having error probability less than .
The capacity of Bernoulli nonadaptive group testing (that is, the maximum achievable rate for any detection algorithm) is
[TABLE]
Achievability was shown by Scarlett and Cevher [4] and the converse by Aldridge [6]. It is easy to check that for most this can be simplified to
[TABLE]
so the - is only needed in the small interval .
However, the achievability result of Scarlett and Cevher [4] does not give a practical decoding algorithm to achieve the rate (1).
Aldridge, Baldassini and Johnson [2] showed that a simple algorithm called DD (for ‘definite defectives’) is optimal for . We define the DD algorithm in Section II. Also, here ‘optimal’ is used throughout to mean ‘achieves the capacity (1).’ One might wonder if DD is also optimal for . Our first main result mostly answers this in the negative: we give a new upper bound for the rate of DD, showing that for DD is strictly suboptimal (Theorem 4).
Two decoding algorithms seem to perform better than DD in practice – thus far without theoretical justification. These are the SCOMP algorithm of Aldridge, Baldassini, and Johnson [2], and approaches based on linear programming (see, for example, [10, 7]). We define these algorithms in Section III-B and III-C. Figure 1 shows a simulation of all three algorithms, demonstrating the superiority of SCOMP and linear programming. This theoretical performance suggests that the rate of these algorithms should be at least as high as DD, so in particular they should also be optimal for . Our second main result proves this (Theorems 8 and 9).
The rate bounds implied by our main results are illustrated in Figure 2.
II DD is suboptimal for
The DD algorithm of Aldridge, Baldassini, and Johnson [2] works as follows:
Any item in a negative test is definitely nondefective. The remaining items we call ‘possible defectives’. 2. 2.
If a (necessarily positive) test contains exactly one possible defective, then that item is in fact definitely defective (‘DD’). 3. 3.
The definitely defective items are declared to be defective. All other items – the definitely nondefective items and any remaining possible defectives – are declared to be nondefective.
DD is ‘practical’ in the sense that it requires only work and storage – which is what is required just to store and read the test design. In comparison, the decoder of Scarlett and Cevher [4] involves solving the maximum likelihood problem, which is in NP in general. Further, DD does not require the detector to know exactly or even approximately, while the maximum likelihood decoder requires the exact value of .
It was shown in [2] that, with Bernoulli tests, the DD algorithm can achieve a rate of
[TABLE]
where .
This can be compared with the capacity for Bernoulli testing (1). Since (1) gives an upper bound of for , and (2) gives the same rate for , we see that for we have the equality
[TABLE]
and DD is optimal in this region. (This was first noted in [6].)
Here we give an upper bound for DD, which shows that it is strictly suboptimal for some smaller values of .
Theorem 4
The rate of DD for Bernoulli nonadaptive group testing is bounded above by
[TABLE]
In particular, DD is strictly suboptimal for , where
[TABLE]
Although the bound (4) appears complicated, it is easy to check that for the second minimand dominates and gives the same bound as the standard capacity bound (1). Further, one can check numerically that for the first minimand of (4) dominates and gives a bound of . It is only for that the complicated expression (4) is necessary. Whether or not DD is optimal in the range remains an open question.
The above bounds are illustrated in Figure 2.
Proof:
The key point is that DD can make use of tests with no defectives (to eliminate ‘possible defectives’), and tests with one defective (if there are no other possible defectives in the test, the defective is ‘definitely defective’), but does not make use of any tests containing two or more defectives (they necessarily contain at least two possible defectives).
Suppose we use independent tests (as we do with a Bernoulli design). Write for the probability a test contains exactly defectives and for the probability a test contains at least defectives. Then a standard information argument gives a bound in terms of the entropy
[TABLE]
(Note that when , this gives the standard counting bound .) While it is helpful to think of this in terms of ‘bits learned per tests’, it can be made rigorous using the concept of typical sets – see, for example, [6, Lemma 1] – which tells us that with high probability we see one of ‘typical’ outcomes.
Since DD does make use of those tests containing or more defectives, we get a tighter bound
[TABLE]
This is because we learn at most bits with probability from the negative tests, and from positive tests we learn bits only with probability , since DD gets no information from a (positive) test with probability .
Now recall we are using a Bernoulli test design. Setting the parameter as (as we must to get a nonzero rate [6]), we have
[TABLE]
as . Hence the bound (5) becomes
[TABLE]
Combined with the capacity bound (1), this gives
[TABLE]
Note, however, that the second minimand
[TABLE]
is always dominated by the first minimand, so the second minimand can be deleted.
For the final part of the theorem, we note that second minimand in (4) is maximised at , and this dominates until
[TABLE]
By rearranging, and noting that , we see that this is precisely when . ∎
We remark that a similar proof technique could be used to bound the rate of the COMP algorithm of Chan et al [1]. This algorithm makes no use of of any positive tests, so the equivalent to the bound (5) would be
[TABLE]
Taking as above, and optimising at gives the bound
[TABLE]
Compared to the actual maximum achievable rate of COMP [1, 6], which is
[TABLE]
we see that such a bound is tight for but loose for .
III Algorithms that are optimal for
We have seen that the DD algorithm with Bernoulli testing achieves the rate (2). Together with the capacity bound (1), we see that DD is optimal for Bernoulli testing for .
As we saw in Figure 1, the SCOMP algorithm [2] and approaches based on linear programming [10, 7] outperform DD in simulations. In this section, we prove that the SCOMP algorithm and LP approaches achieve at least as high a rate as DD, so in particular satisfy the bound (2) and are also optimal for .
III-A Property
The crucial step to showing an algorithm outperforms DD is the a property we call ‘Property ’. First recall the following definition.
Definition 5
A set is satisfying if no negative test contains any item from and every positive test contains at least one item from .
So a satisfying set is one that ‘fits’ the observations. In particular, the true defective set is always a satisfying set, although their may be others.
Definition 6
A detection algorithm A has Property if, whenever the DD algorithm outputs a satisfying set, then A also outputs that same satisfying set.
Lemma 7
Suppose the detection algorithm A satisfies Property . Then A succeeds whenever DD succeeds. In particular, with Bernoulli nonadaptive group testing, we have
[TABLE]
and A is optimal for .
Proof:
Note that DD is not guaranteed to output a satisfying set. If the output of DD is not satisfying, then clearly the algorithm is in error. If the output of DD is satisfying, then it is necessarily the unique smallest satisfying set (since each definite defective appears uniquely in some test, so must be in the true defective set), and the algorithm may be successful.
Since algorithm A has Property , in this latter case, A also outputs this unique smallest satisfying set. Hence, whenever DD succeeds, A will succeed also.
The second part of the lemma then follows immediately from (2). ∎
III-B SCOMP
SCOMP is an algorithm due to Aldridge, Baldassini, and Johnson [2] that builds a satisfying set by starting from the DD set of definite defectives and sequentially adding new items until a satisfying set is reached. (The name comes from ‘Sequential COMP’, as it can be viewed as a sequential version of the COMP algorithm of Chan et al [1].)
Although recalling the precise details of how SCOMP extends from DD to a satisfying set is not required for the proof of following result, we define it here for completeness.
Any item in a negative test is definitely nondefective. The remaining items we call ‘possible defectives’. 2. 2.
If a (necessarily positive) test contains exactly one possible defective, then that item is in fact definitely defective (‘DD’). 3. 3.
The definitely defective items are declared to be defective, we call these , and definitely nondefective items are declared to be nondefective. The other possible defectives are not yet declared either way. 4. 4.
Any positive test is called ‘unexplained’ if it does not contain any items from . Add to the possible defective not in that is in the most unexplained tests, marking the corresponding tests as no longer unexplained. (Ties may be broken arbitrarily.) 5. 5.
Repeat step 4 until no tests remain unexplained. The estimate of the defective set is .
Note that a satisfying set leaves no unexplained tests, and that any set containing no definite nondefectives and leaving no unexplained tests is satisfying. Note also that the set of all possible defectives is satisfying, so the SCOMP algorithm does indeed terminate.
Theorem 8
SCOMP with Bernoulli tests can achieve the rate
[TABLE]
In particular, SCOMP is optimal for .
Proof:
Following Lemma 7, we have wo show that SCOMP satisfies Property .
As explained above, the SCOMP algorithm begins with the DD algorithm, then takes further steps to ensure a satisfying set is reached. However, if DD already provides a satisfying set, no further steps are taken, and SCOMP halts and outputs the DD set. Hence Property is satisfied. ∎
III-C Linear programming approaches
LP algorithms are based on the fact choosing the smallest satisfying set as an estimate is known to be optimal [6], but is likely to be impractical, as it requires solving an instance of the NP-hard set cover problem. Specifically, the smallest satisfying set is related to the solution to the [math]– linear program
[TABLE]
(Recall that indicates if item is in test , and is the outcome of test .) The set is the smallest satisfying set.
LP approaches attempt to estimate the defective set via the relaxed version of the [math]– problem, where the s can be any non-negative real numbers. Linear programs like this can be solved quickly using, for example, the simplex algorithm.
There are various heuristics for how to turn the optimal solution to the relaxed program into an estimate of the defective set. For the purposes of Theorem 9, it suffices that in the event that all s are [math] or then the heuristic chooses (as any sensible heuristic must).
For example, one could consider the following very crude method: if there is any with , declare a global error; else estimate to be the defective set. Malioutov and Malyutov [10] suggest using the estimate , and show strong performance on simulated problems. (This was the ‘Malioutov rule’ used for the simulations in Figure 1.) Another suggestion could be to estimate , or to place each item in independently with probability .
Along these lines, the ‘LiPo’ algorithm of Chan et al [7] assumes the detector knows exactly, so the LP relaxation can be instead phrased as a feasibility problem. They show that LiPo achieves a nonzero rate for all , and specifically
[TABLE]
However, by using Property , we can show a higher rate, which is optimal for .
Theorem 9
Any LP approach as described above with Bernoulli tests can achieve the rate
[TABLE]
In particular, LP approaches are optimal for .
Proof:
Following Lemma 7, we must show that Property is satisfied.
Note that any item in a negative test (a definite nondefective) will have to satisfy the second constraint of the linear program. Further, if a test contains only one possible (and thus definite) defective , the LP solution must have to satisfy the corresponding constraint, and will choose to minimise . Finally, if these definite defectives form a satisfying set, all constraints are satisfied, and the LP will set all other s to [math], to minimise . Hence Property holds, and we are done. ∎
Acknowledgments
The author thanks Jonathan Scarlett for useful discussions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C. L. Chan, P. H. Che, S. Jaggi, and V. Saligrama, “Non-adaptive probabilistic group testing with noisy measurements: Near-optimal bounds with efficient algorithms,” in Communication, Control, and Computing (Allerton), 2011 49th Annual Allerton Conference on , Sept 2011, pp. 1832–1839.
- 2[2] M. Aldridge, L. Baldassini, and O. Johnson, “Group testing algorithms: Bounds and simulations,” IEEE Transactions on Information Theory , vol. 60, no. 6, pp. 3671–3687, June 2014.
- 3[3] M. Aldridge, L. Baldassini, and K. Gunderson, “Almost separable matrices,” Journal of Combinatorial Optimization , vol. 33, no. 1, pp. 215–236, 2017.
- 4[4] J. Scarlett and V. Cevher, “Limits on support recovery with probabilistic models: An information-theoretic framework,” IEEE Transactions on Information Theory , vol. 63, no. 1, pp. 593–620, Jan 2017.
- 5[5] ——, “Phase transitions in group testing,” in Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms , 2016, pp. 40–53.
- 6[6] M. Aldridge, “The capacity of Bernoulli nonadaptive group testing,” 2015, ar Xiv:1511.05201 [cs.IT].
- 7[7] C. L. Chan, S. Jaggi, V. Saligrama, and S. Agnihotri, “Non-adaptive group testing: Explicit bounds and novel algorithms,” IEEE Transactions on Information Theory , vol. 60, no. 5, pp. 3019–3035, May 2014.
- 8[8] M. B. Malyutov, “The separating property of random matrices,” Mathematical Notes of the Academy of Sciences of the USSR , vol. 23, no. 1, pp. 84–91, 1978.
