# Efficient and error-tolerant schemes for non-adaptive complex group   testing and its application in complex disease genetics

**Authors:** Thach V. Bui, Minoru Kuribayashi, Mahdi Cheraghchi, and Isao Echizen

arXiv: 1904.00349 · 2019-04-02

## TL;DR

This paper introduces efficient, error-tolerant algorithms for complex group testing, significantly improving identification time and test efficiency, with applications in genetics and biology.

## Contribution

The paper proposes novel schemes for classical and generalized complex group testing that are faster and more error-resilient than existing methods.

## Key findings

- Efficient identification of positive complexes in time proportional to $t 	imes 	ext{poly}(d, \\ln n)$.
- Reduced number of tests compared to previous approaches in certain cases.
- Applicability to complex disease genetics and molecular biology.

## Abstract

The goal of combinatorial group testing is to efficiently identify up to $d$ defective items in a large population of $n$ items, where $d \ll n$. Defective items satisfy certain properties while the remaining items in the population do not. To efficiently identify defective items, a subset of items is pooled and then tested. In this work, we consider complex group testing (CmplxGT) in which a set of defective items consists of subsets of positive items (called \textit{positive complexes}). CmplxGT is classified into two categories: classical CmplxGT (CCmplxGT) and generalized CmplxGT (GCmplxGT). In CCmplxGT, the outcome of a test on a subset of items is positive if the subset contains at least one positive complex, and negative otherwise. In GCmplxGT, the outcome of a test on a subset of items is positive if the subset has a certain number of items of some positive complex, and negative otherwise.   For CCmplxGT, we present a scheme that efficiently identifies all positive complexes in time $t \times \mathrm{poly}(d, \ln{n})$ in the presence of erroneous outcomes, where $t$ is a predefined parameter. As $d \ll n$, this is significantly better than the currently best time of $\mathrm{poly}(t) \times O(n \ln{n})$. Moreover, in specific cases, the number of tests in our proposed scheme is smaller than previous work. For GCmplxGT, we present a scheme that efficiently identifies all positive complexes. These schemes are directly applicable in various areas such as complex disease genetics, molecular biology, and learning a hidden graph.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.00349/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/1904.00349/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/1904.00349/full.md

---
Source: https://tomesphere.com/paper/1904.00349