Error Exponent Bounds for the Bee-Identification Problem
Anshoo Tandon, Vincent Y. F. Tan, Lav R. Varshney

TL;DR
This paper introduces the bee-identification problem, defines its error exponent, and derives bounds showing joint decoding outperforms separate decoding, with tight bounds at low rates.
Contribution
It formally defines the bee-identification problem, introduces error exponent bounds, and demonstrates the superiority of joint decoding over separate decoding.
Findings
Joint decoding significantly improves error exponents.
Lower bounds using typical random codes outperform random code ensembles.
Bounds converge at zero rate, indicating optimality of joint decoding in that regime.
Abstract
Consider the problem of identifying a massive number of bees, uniquely labeled with barcodes, using noisy measurements. We formally introduce this `bee-identification problem', define its error exponent, and derive efficiently computable upper and lower bounds for this exponent. We show that joint decoding of barcodes provides a significantly better exponent compared to separate decoding followed by permutation inference. For low rates, we prove that the lower bound on the bee-identification exponent obtained using typical random codes (TRC) is strictly better than the corresponding bound obtained using a random code ensemble (RCE). Further, as the rate approaches zero, we prove that the upper bound on the bee-identification exponent meets the lower bound obtained using TRC with joint barcode decoding.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced biosensing and bioanalysis techniques · DNA and Biological Computing · SARS-CoV-2 detection and testing
Error Exponent Bounds for the
Bee-Identification Problem
Anshoo Tandon, , Vincent Y. F. Tan, ,
and Lav R. Varshney A. Tandon is with the Department of Electrical and Computer Engineering, National University of Singapore, Singapore 117583 (email: [email protected]. Y. F. Tan is with the Department of Electrical and Computer Engineering, and with the Department of Mathematics, National University of Singapore, Singapore (email: [email protected]).L. R. Varshney is with the Coordinated Science Laboratory and the Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801 USA (email: [email protected])
Abstract
Consider the problem of identifying a massive number of bees, uniquely labeled with barcodes, using noisy measurements. We formally introduce this “bee-identification problem”, define its error exponent, and derive efficiently computable upper and lower bounds for this exponent. We show that joint decoding of barcodes provides a significantly better exponent compared to separate decoding followed by permutation inference. For low rates, we prove that the lower bound on the bee-identification exponent obtained using typical random codes (TRC) is strictly better than the corresponding bound obtained using a random code ensemble (RCE). Further, as the rate approaches zero, we prove that the upper bound on the bee-identification exponent meets the lower bound obtained using TRC with joint barcode decoding.
I Introduction
Consider a group of different bees, in which each bee is tagged with a unique barcode for identification purposes in order to understand interaction patterns in honeybee social networks [1]. Assume that a camera is employed to picture the beehive to study the interactions among bees. The image output (see Fig. 1) can be considered as a noisy and unordered set of barcodes. We formally pose the problem of bee-identification from a beehive image as an information-theoretic problem (Sec. I-B).
The bee-identification problem has applications in identification of warehouse products (labeled with unique RFID barcodes) using wide-area sensors. Other applications include package-distribution to recipients from a batch of deliveries with noisy address labels, and similar “bipartite matching” settings. It also has potential applications in identification of the mapping between signals and their meaning in “alien communication” with extraterrestrials, and also in learning communication protocols among robots, via the use of pilot signals going through the alphabet.
We consider the scenario where the barcode for each bee is represented as a binary vector of length , and the bee barcodes are collected in a codebook comprising rows and columns, with each row corresponding to a bee barcode. As shown in Fig. 2, the channel first permutes the rows of with a random permutation to produce . The entries of are then subjected to noise (corresponding to a binary symmetric channel (BSC) with crossover probability ), and the channel output is denoted . We assume that the decoder has knowledge of codebook , and its task is to recover the row-permutation introduced by the channel. Note that the permutation directly ascertains the identity of all the bees.
I-A Related Work
In a related work motivated by an Internet of Things (IoT) setting, the identification of users in strongly asynchronous massive access channels was studied [2]. The identification of the underlying distributions of a set of observed sequences (where each sequence is generated i.i.d. by a distinct distribution) was analyzed in [3]. The bee-identification problem, on the other hand, allows codebooks where all barcode sequences are generated using the same underlying distribution.
In another related work [4], the fundamental limits of data storage via unordered DNA molecules was investigated. Here, a DNA molecule corresponds to an -length sequence over an alphabet of size 4, and the information is written onto DNA molecules stored in an unordered way. The storage capacity results in [4] were extended to noisy settings in [5] where the channel adds noise and randomly permutes the DNA molecules used to store information. The capacity results are obtained under the scenario where the length, , of each DNA molecule grows with . Although the effective channel in [5] is closely related to the bee-identification channel in Fig. 2, we note that the fundamental problem in [5] is to quantify the data storage capacity, while the main issue in the bee-identification problem is the identification of the row-permutation induced by the channel.
Data communication over permutation channels with impairments was analyzed in [6]. The authors of [6] presented bounds on the size of optimal codes over a finite input alphabet, when the channel randomly permutes the letters of the input sequence in addition to causing impairments such as insertions, deletions, and substitutions. The effective channel for the bee-identification problem (see Fig. 2) differs from the communication channel in [6] in two aspects: (i) The input to the channel in the bee-identification problem is the entire codebook, not just a codeword belonging to the codebook. (ii) The channel in Fig. 2 only permutes the rows of the codebook, but does not permute the letters within a row.
I-B Bee-Identification Problem Formulation
The channel output is a row-permuted and noisy version of the codebook. If denotes a given permutation of -letters, then the channel first permutes the rows of codebook , based on , to produce (see Fig. 2). Therefore, if and the -th row of codebook is denoted , then the -th row of is equal to . The channel then applies noise on the permuted codebook to produce , where noise is modeled by a BSC with crossover probability , denoted BSC(), with . If , and denotes the -th row of , then
[TABLE]
where denotes the Hamming distance between vectors and . Let , and let the decoder correspond to a function which takes as an input and produces a map where corresponds to the index of the transmitted codeword which produced the received word , for . In effect, the bee-identification problem is that the decoder has to recover the row-permutation introduced by the channel, by using the knowledge of codebook and the channel output .
I-C Bee-Identification Error Exponent
The indicator for the bee-identification error is defined as
[TABLE]
For a given codebook and decoding function , the expected bee-identification error probability over the BSC() is
[TABLE]
where the inner expectation is over the distribution of given and (see (1)), and the outer expectation is over a uniform distribution of over all -letter permutations. Note that (2) can be equivalently expressed as
[TABLE]
For a given , let the number of barcodes scale exponentially with blocklength as . Now, for given values of and , define the minimum expected bee-identification error probability as
[TABLE]
where the minimum is over all codebooks of size , and all decoding functions .
Define, , the exponent corresponding to the minimum expected bee-identification error probability, as
[TABLE]
We introduce some notation that is used in the rest of the paper. We will denote when . Similarly, we write (respectively, ) if (respectively, ).
I-D Our Contributions
The “bee-identification problem” is introduced and the corresponding bee-identification exponent is analyzed in this paper. In particular, we provide the following explicit bounds on this exponent.
- •
A lower bound on using a random code ensemble (RCE) with independent barcode decoding (Sec. II-A) and joint barcode decoding (Sec. II-B).
- •
A lower bound on using typical random codes (TRC) with independent barcode decoding (Sec. III-A) and joint barcode decoding (Sec. III-B).
- •
An upper bound on which is applicable to all possible codebook designs (Sec. IV).
We show that joint decoding of barcodes provides a significantly better exponent compared to separate decoding followed by learning the permutation. For low rates, we prove that the lower bound obtained using TRC is strictly better than the corresponding bound obtained using RCE. Further, as the rate approaches zero, we prove that the upper bound meets the lower bound obtained using TRC with joint barcode decoding.
II Random Code Ensemble
In this section, we present lower bounds on using an RCE [7]. Let denote the set of all binary matrices with rows and columns. Assume that codebook is uniformly distributed over . It is immediate from the definition of (4) that
[TABLE]
where the expression on the right denotes the average performance using RCE. We proceed by quantifying this expression when the decoding function corresponds to: (i) independent barcode decoding (Sec. II-A), and (ii) joint barcode decoding (Sec. II-B). The main results in this section are as follows: we present explicit lower bounds on using independent barcode decoding (Thm. 1) and joint barcode decoding (Thm. 2). It is shown (Prop. 2) that the bee-identification exponent obtained using joint barcode decoding is strictly better than the corresponding exponent obtained with independent barcode decoding.
II-A Independent Decoding for Each Barcode
Here, we analyze a naïve decoding strategy where each barcode is decoded independently. In this case, for , the decoder picks , the -th row of , and then decodes it to . If there is more than one codeword at the same minimum Hamming distance from , then any one of the corresponding codeword indices is chosen at random. From (3) and the union bound, we have
[TABLE]
[TABLE]
Now define
[TABLE]
Note that is independent of index due to the averaging over the ensemble of codebooks uniformly distributed over . For , the expression for corresponds to the probability of error when the -th codeword is transmitted over BSC(). From (8) and (9), we get
[TABLE]
The following theorem uses (10) to present an explicit lower bound on .
Theorem 1**.**
We have
[TABLE]
where , and
[TABLE]
Proof:
It is well known that the random coding exponent over BSC(), defined as , is given by [8, 7]
[TABLE]
where denotes the binary entropy function, is the Gilbert-Varshamov (GV) distance [7] defined as the value of in the interval with , and is the critical rate given by , and
[TABLE]
Using the fact that , and combining (5), (10), and the definition of , we get
[TABLE]
Now, using explicit numerical computation, it can be shown that . The proof is complete by combining (13), (14), and noting that when because is a decreasing function of . ∎
The lower bound on given by (11) was obtained by applying a naïve decoding strategy where each barcode was decoded independently. In the next subsection, we analyze the bee-identification exponent using joint barcode decoding.
II-B Joint Decoding of Barcodes
Let denote the set of permutations of . For joint maximum likelihood (ML) decoding of barcodes, the decoding function takes the noisy row-permuted codebook as input, and produces permutation as output, where , and . We aim to provide bounds on .
For any two permutations , the sets of distances and are equal. Therefore, the performance of the joint ML decoder is independent of the channel permutation , and we assume, without loss of generality, that the permutation induced by the channel is the identity permutation, denoted .
For a given codebook at the transmitter, let denote the received noisy codebook at the output of the effective channel, and for with , we define
[TABLE]
where the event is said to occur if . From (3), we have
[TABLE]
where (15) follows from the union bound. Now define
[TABLE]
which denotes the probability of the event , averaged over the ensemble of random binary codebooks. Using (6), (15), and (16), we get
[TABLE]
Now consider two codewords , at distance from each other. Given that is transmitted over BSC(), the probability that the Hamming distance of the received word from is not more than its distance from is [7]
[TABLE]
where
[TABLE]
Therefore, for a given codebook and permutation with , if , then
[TABLE]
In the following, we quantify for different , via (16) and (19).
II-B1 is a transposition
We first consider the case where is a transposition, i.e. a permutation that interchanges only two indices. For indices , with , the Hamming distance between codewords and in a random codebook satisfies [7]
[TABLE]
When is the permutation that only transposes indices and , then if and only if . Thus, it follows from (20) that . Further, when , we have . Therefore, the probability can be characterized using (16), (19), and (20) as
[TABLE]
If is treated as a continuous variable, then the exponent is a convex function with a unique minimum at where
[TABLE]
Therefore, for , we have
[TABLE]
Now, if we define , then it follows from (21) that
[TABLE]
Further, we have , where
[TABLE]
Hence, it follows from (23) and (24) that
[TABLE]
where is a transposition.
II-B2 is a product (composition) of disjoint transpositions
We now consider the case where , where and are disjoint transpositions with and . As the codewords in a random codebook are independent, then using (20), we have . Further, if and , then , and . Therefore, if is a product of two disjoint transpositions, then
[TABLE]
In general, when is a product of disjoint transpositions, the above argument can be readily extended to show that
[TABLE]
Now, define
[TABLE]
where and are defined in (12) and (24), respectively. As , it follows from (26) that
[TABLE]
We remark that when is just a transposition, then from (25) we have , which is only a special case of (27) with .
II-B3 is a -cycle with
Let be a -cycle where for , and . We will apply the following proposition towards characterizing .
Proposition 1**.**
Let denote the space of all -length binary vectors. Let be random vectors, uniformly distributed over , and let be given non-negative integers. Then the following holds
[TABLE]
Proof:
See Appendix A. ∎
For a given codebook , if for , and , then , and we have
[TABLE]
Further, if codebook is uniformly distributed over ,
[TABLE]
where (30) follows from (28). Combining (29) and (30),
[TABLE]
If is treated as a continuous variable, then the exponent is a convex function with a unique minimum at , where
[TABLE]
We have
[TABLE]
and therefore
[TABLE]
where . Combining (31) and (33),
[TABLE]
As for , we have , and it follows from (34) that
[TABLE]
The above equation has been derived for the case where is a -cycle with . However, a transposition is just a -cycle with , and from the remark following (27), it follows that (35) holds even for .
II-B4 General with
It is well known that any permutation can be written as a product (composition) of disjoint cycles, for [9]. Consider a given which is a product of disjoint cycles of length , respectively, where for . Then, we can extend the result in (35) to obtain
[TABLE]
II-B5 Putting it all together
For , if we define
[TABLE]
then (17) can be equivalently expressed as
[TABLE]
Note that the set is empty, as the Hamming distance between two distinct permutations is at least two. The set consists of all transpositions and . For all , the value of is given by (25), and combining this with (38), we get
[TABLE]
For a given , if , then from (36) it follows that . For , the size of the set satisfies . If we define
[TABLE]
then we have . Now, if , then because , there exists such that for , we have and hence . Therefore, for ,
[TABLE]
As and when , it follows from (41) that
[TABLE]
Combining (39), (40), and (42), for ,
[TABLE]
Comparing (17) with (43), we observe that the error probability is dominated by terms for corresponding to -cycles with and . The next theorem presents an explicit lower bound for when the decoder jointly decodes all the barcodes using a maximum likelihood approach.
Theorem 2**.**
We have
[TABLE]
where .
Proof:
If , then . Therefore, from (43) it follows that if , then is lower bounded by . Further, note that if and only if . ∎
The following proposition shows that the lower bound (44) (obtained using joint decoding of barcodes) is strictly better than the bound given by (11) (obtained with independent decoding of barcodes) in the interval where it is positive.
Proposition 2**.**
When and , then we have the strict inequality
[TABLE]
Proof:
When , we have , and hence . If , then . The proof is complete by combining these observations with the definition of . ∎
Note that for , because in this case . In the following section, we present improved lower bounds on by analyzing typical random codebooks.
III Typical Random Code
TRCs are known, in general, to provide higher error exponents than RCE over a BSC [7, 10]. Roughly speaking, TRCs are characterized by the property that their relative minimum distance is at least . Formally, for , , and indices , the Hamming distance between codewords and in a TRC satisfies [7]
[TABLE]
where , , and .
Let denote the set of all codebooks of size , with the property that the Hamming distance between a pair of codewords and satisfies the relation for all . Note that if codebook is uniformly distributed over , then the Hamming distance between a pair of distinct codewords satisfies (45). It is immediate from (4) that
[TABLE]
where the expression on the right denotes the average performance using TRCs.
In this section we provide lower bounds on the bee-identification exponent using TRCs. The case where each barcode is decoded independently is analyzed in Sec. III-A while joint barcode decoding is analyzed in Sec. III-B. It is shown that these lower bounds on using TRCs outperform the corresponding bounds for RCEs when the rate is smaller than a certain threshold.
III-A Independent Decoding of Barcodes
With independent barcode decoding, the decoder picks , the -th row of , and then assigns , for . From the union bound, we have , and using (46) we get
[TABLE]
Let . Note that is independent of the index due to the symmetry resulting from averaging over codebooks uniformly distributed over . For , the expression for corresponds to the probability of error when the -th codeword is transmitted. From (47), we get
[TABLE]
The following theorem uses (48) to present an explicit lower bound on when the rate is smaller than a certain threshold.
Theorem 3**.**
We have
[TABLE]
where is defined in (18), and
[TABLE]
Proof:
It is known that for , the error exponent using a TRC over BSC(), defined as , is given by [7]
[TABLE]
Using the fact that , and combining (5), (48), with the definition of , we get
[TABLE]
The proof is completed by applying (51) in (52). ∎
It is well known that for [7]. This implies that the lower bound on for TRC given by (49) is strictly better than the corresponding bound for RCE given by (11) when . The next subsection provides a more refined bound on by analyzing joint decoding of barcodes using TRCs.
III-B Joint Decoding of Barcodes
With joint barcode decoding, the decoder takes the noisy row-permuted codebook as input, and produces the permutation as output, where . As in Sec. II-B, we assume, without loss of generality, that the permutation induced by the channel is the identity permutation . For a given codebook , we have . If we define
[TABLE]
where the expectation is over a uniform distribution of codebook over , then we have
[TABLE]
In the following, we quantify for different , in order to bound via (54).
III-B1 is a transposition
If is the permutation that only transposes indices and , and , then , and we have
[TABLE]
When is uniformly distributed and , then
[TABLE]
where (56) follows from (45). Combining (53), (55), and (56), we get
[TABLE]
If is treated as a continuous variable, then the exponent is a convex function of with a unique minimum at defined in (22). If we define
[TABLE]
then for , we have
[TABLE]
The exponent increases monotonically in for . Therefore, if and , the exponent in (57) is minimized for , and we have
[TABLE]
where .
III-B2 is a -cycle
We now consider the case where is a -cycle with . We will apply the following proposition towards characterizing .
Proposition 3**.**
Let be distinct rows in codebook , and let satisfy for . Let denote the probability when is uniformly distributed over . Then, we have
[TABLE]
where
[TABLE]
and denotes the probability when is uniformly distributed over .
Proof:
See Appendix B. ∎
Now, given that and for , and , we have , and therefore
[TABLE]
If , then combining (60) and (62), we get
[TABLE]
where, for , we have
[TABLE]
The function is a convex function of , and has a unique minimum that occurs at defined in (32). From (50) we observe that . Thus, if , then we have . Further, is an increasing function of for , and so if and , the exponent in (64) is minimized when . Thus, we have
[TABLE]
Combining (63), (65), and (66), for ,
[TABLE]
where is a -cycle with . As for , it follows from (67) that
[TABLE]
Recall that and are given by (22) and (58), respectively. As is an increasing function of , and , it follows that , which implies that . Note that a transposition is simply a -cycle with , and comparing (59) with (68) we observe that the relation given by (68) holds even when .
III-B3 is a product (composition) of two disjoint cycles
We now consider the case where , where and are disjoint cycles of length and , respectively. Let and . If for , then a straightforward extension of Prop. 3 shows that the probability \Pr\Big{\{}\bigcap_{l=1}^{k_{1}-1}\{\mathrm{d_{H}}({\boldsymbol{c}}_{i_{l}},{\boldsymbol{c}}_{i_{l+1}})=d_{l}\}\bigcap\left\{\mathrm{d_{H}}({\boldsymbol{c}}_{i_{k_{1}}},{\boldsymbol{c}}_{i_{1}})=d_{k_{1}}\right\}\\ ~{}~{}~{}~{}~{}~{}\bigcap_{l=k_{1}+1}^{k_{1}+k_{2}-1}\left\{\mathrm{d_{H}}({\boldsymbol{c}}_{i_{l}},{\boldsymbol{c}}_{i_{l+1}})=d_{l}\right\}\\ ~{}~{}~{}~{}~{}~{}\bigcap\left\{\mathrm{d_{H}}({\boldsymbol{c}}_{i_{k_{1}+k_{2}}},{\boldsymbol{c}}_{i_{k_{1}+1}})=d_{k_{1}+k_{2}}\right\}\Big{\}} is upper bounded by
[TABLE]
Further, for a given codebook , with , , , , we have , and therefore
[TABLE]
Combining (69) and (70), we can upper bound by
[TABLE]
The above expression can be equivalently written as
[TABLE]
where and are given by (64) and (65), respectively. Now, applying (65), (66) in (72) for , we get
[TABLE]
where . As and , we have , and therefore for , we have
[TABLE]
III-B4 General with
If permutation is a product of disjoint cycles of length , respectively, then similar to (68), (74), we have for ,
[TABLE]
III-B5 Putting it all together
For , if we define , where is given by (37), then (54) can be equivalently expressed as
[TABLE]
If is a product of disjoint cycles of length , respectively, and , then belongs to the set , and is given by (75). Equivalently, for a given , if belongs to the set , then for ,
[TABLE]
The size of satisfies . Therefore, for , we have
[TABLE]
Now, if we define , then (78) can be equivalently expressed as . As , there exists such that for , we have and hence . Therefore, for and , we have
[TABLE]
where (79) follows because as [7], (80) follows because , and (81) follows because . Note that , and so and . As can be made arbitrarily small, it follows from (81) that for , we have
[TABLE]
The following theorem encapsulates the main result of this subsection on bounding the bee-identification exponent, , using joint decoding for TRC.
Theorem 4**.**
We have
[TABLE]
Proof:
We note that the above lower bound for using TRCs with joint barcode decoding is twice the corresponding bound obtained using independent barcode decoding (see (49)). The following proposition shows that the lower bound given by Thm. 4 using TRC is strictly better than corresponding bound using RCE (see Thm. 2) for .
Proposition 4**.**
The lower bound on in (83) obtained for TRC is strictly better than the corresponding bound in (44) obtained for RCE when .
Proof:
It is known that when [7]. Further, using explicit numerical computation, it can be shown that . Therefore, it follows that for , we have
[TABLE]
∎
The next section presents an explicit upper bound for which applies to all possible codebook designs.
IV Upper Bound on the Bee-Identification Exponent
This section presents an upper bound on the bee-identification exponent . Towards this, we define the following optimum minimum distance metrics
[TABLE]
For any given codebook , we show that there exists a set consisting of pairs of codeword indices , , with the following properties:
- (i)
If , then . 2. (ii)
If and , then and . 3. (iii)
Size of set is at least .
A set satisfying the above properties can be constructed iteratively as follows.
- •
Step 1: For a given codebook , initialize to be the empty set and let .
- •
Step 2: As contains at least codewords, there exists , with , satisfying . Include the pair to , and let .
- •
Step 3: If , then go to Step 2, else stop.
Let the receiver employ ML decoding, and interpret each pair as a transposition that interchanges indices and . Let denote the error event that the receiver incorrectly decodes the channel induced permutation to transposition (instead of the identity permutation ), i.e. . Then, the bee-identification error probability can be lower bounded as
[TABLE]
Using de Caen’s lower bound on the probability of a union [11], the expression on the right side in (84) can itself be lower bounded by
[TABLE]
where follows because events and are independent when . Now
[TABLE]
where follows from the fact that for , and follows because . If , then combining (84), (85), (86), and noting that increases with , we have
[TABLE]
As (87) is true for all , we have
[TABLE]
The value can be upper bounded as [12, 13]
[TABLE]
The following theorem provide an upper bound on the bee-identification exponent .
Theorem 5**.**
We have
[TABLE]
Proof:
Follows immediately from (88) and (89). ∎
The following corollary shows that can be explicitly characterized with a rather simple expression when rate tends to zero.
Corollary 1**.**
We have
[TABLE]
Proof:
As , we have from (90) that
[TABLE]
On the other hand, we have and so it follows from (83) that
[TABLE]
The proof is completed by using (92) and (93). ∎
The above corollary shows that the lower bound on given by (83), and the upper bound on given by (90) become tight as .
V A Numerical Example
Fig. 3 plots different bounds for the bee-identification exponent . The explicit lower bound for RCE with independent decoding (ID) (respectively, joint decoding (JD)) is given by (11) (respectively, (44)). The performance with JD is seen to be much better than with ID. When , the explicit lower bound for TRC with ID (respectively, JD) is given by (49) (respectively, (83)). As shown in Prop. 4, the lower bound obtained using TRC with joint decoding is better than the corresponding bound using RCE. The upper bound is given by (90) and holds for all possible codebook designs. Further, as shown in Cor. 1, it is observed from Fig. 3 that for .
VI Discussion
We introduced the information-theoretic “bee-identification problem” which arises naturally in different massive identification settings. We derived explicit upper and lower bounds on the bee-identification exponent, and showed that joint decoding of barcodes provides a significantly better exponent than separate decoding followed by permutation inference. For low rates, we showed that the lower bound on the bee-identification exponent obtained using TRC is strictly better than the corresponding bound obtained using RCE. Moreover, when the rate approaches zero, we showed that the upper bound on the bee-identification exponent coincides with the lower bound obtained using TRC with joint barcode decoding.
Relative to the independent decoding of barcodes, the performance improvement with joint decoding comes at a cost of increased computational complexity. For joint decoding, an exhaustive search entails comparing the received noisy & permuted version of the codebook with row-permutations of the codebook. This may be computationally prohibitive even for moderate values of blocklength when scales exponentially with . In practice, intermediate performance between the extremes of independent decoding and joint decoding may be achieved with manageable complexity using ideas from generalized minimum distance decoding [14]. In particular, the decoding process may proceed in two steps: The first step involves independent decoding of each barcode where an erasure is declared if the distance between the received noisy barcode to the nearest barcode in the codebook exceeds a threshold. The second step fixes the codebook row-indices corresponding to the un-erased barcodes, and then decodes the erased barcodes by jointly comparing their received noisy version to different row-permutations of the codebook corresponding to the non-fixed indices. This results in significant reduction in complexity in case only a few barcodes are declared as erasure in the first step. Therefore, we have a tradeoff between performance and complexity via an appropriate choice of the distance threshold parameter for declaring an erasure.
The work in this paper may be extended by considering different variants of the bee-identification error metric, for instance, where error is flagged only when the fraction of incorrectly decoded barcodes exceeds a threshold. Another interesting scenario for future analysis is the problem formulation where some of the rows in codebook are deleted, due to some bees being outside the hive when taking the picture.
Appendix A Proof of Prop. 1
Proof:
Let , and , where denotes modulo-2 addition. Then, , where follows from the fact that for a given , the distribution of is same as the distribution of . This implies that . Then can be expressed as
[TABLE]
where denotes the indicator function, and follows from . Recursively applying (94), we get
[TABLE]
Now, (28) follows from the fact that when and are uniformly distributed over [7]. ∎
Appendix B Proof of Prop. 3
Proof:
For , let denote the -th row of codebook . Let denote the space of all -length binary vectors, and let for . Let denote the probability when is uniformly distributed over . Then, we have
[TABLE]
where denotes the indicator function. Further, let denote the probability when codebook is uniformly distributed over . Then,
[TABLE]
where follows from (95), and follows from Prop. 1.
∎
Acknowledgement
The authors acknowledge discussions with Ting-Yi Wu and Tim Gernat on the bee-identification problem formulation.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] T. Gernat et al. , “Automated monitoring of behavior reveals bursty interaction patterns and rapid spreading dynamics in honeybee social networks,” Proc. Nat. Acad. Sci. U.S.A. , vol. 115, no. 7, pp. 1433–1438, Feb. 2018.
- 2[2] S. Shahi, D. Tuninetti, and N. Devroye, “The strongly asynchronous massive access channel,” Jul. 2018, ar Xiv:1807.09934 [cs.IT].
- 3[3] S. Shahi, D. Tuninetti, and N. Devroye, “On identifying a massive number of distributions,” in Proc. 2018 IEEE Int. Symp. Inf. Theory , Jun. 2018, pp. 331–335.
- 4[4] R. Heckel, I. Shomorony, K. Ramchandran, and D. N. C. Tse, “Fundamental limits of DNA storage systems,” in Proc. 2017 IEEE Int. Symp. Inf. Theory , Jun. 2017, pp. 3130–3134.
- 5[5] I. Shomorony and R. Heckel, “Capacity results for the noisy shuffling channel,” Feb. 2019, ar Xiv:1902.10832 [cs.IT].
- 6[6] M. Kovačević and V. Y. F. Tan, “Codes in the space of multisets – coding for permutation channels with impairments,” IEEE Trans. Inform. Theory , vol. 64, no. 7, pp. 5156–5169, Jul. 2018.
- 7[7] A. Barg and G. D. Forney, “Random codes: minimum distances and error exponents,” IEEE Trans. Inform. Theory , vol. 48, no. 9, pp. 2568–2573, Sep. 2002.
- 8[8] R. G. Gallager, Information Theory and Reliable Communication . New York: John Wiley and Sons, 1968.
