Lossy Asymptotic Equipartition Property For Geometric Networked Data Structures
Kwabena Doku-Amponsah

TL;DR
This paper extends the Asymptotic Equipartition Property to geometric networked data, specifically wireless sensor networks modeled as colored geometric random graphs, using large deviation principles.
Contribution
It introduces a generalized AEP for CGRGs and applies it to real sensor network data for water quality monitoring.
Findings
Extended AEP to CGRGs using large deviation techniques
Validated theoretical results with real sensor network data
Provided insights into data compression for geometric network models
Abstract
This article extends the Generalized Asypmtotic Equipartition Property of Networked Data Structures to cover the Wireless Sensor Network modelled as coloured geometric random graph (CGRG). The main techniques used to prove this result remains large deviation principles for properly defined empirical measures on CGRGs. As a motivation for this article, we apply our results to some data from Wireless Sensor Network for Monitoring Water Quality from a Lake..
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Lossy Asymptotic Equipartition Property for Geometric Networked Data Structures
By Kwabena Doku-Amponsah
††Mathematics Subject Classification : 94A15, 94A24, 60F10, 05C80††*Keywords: * Information theory, rate-distortion theory,exponential equivalent measures, relative entropy, geometric networked data structures, wireless sensor networks.††Address: Statistics Department, University of Ghana, Box LG 115, Legon,Ghana. E-mail: [email protected].
University of Ghana
Abstract. This article extends the Generalized Asypmtotic Equipartition Property of Networked Data Structures to cover the Wireless Sensor Network modelled as coloured geometric random graph (CGRG). The main techniques used to prove this result remains large deviation principles for properly defined empirical measures on CGRGs. As a motivation for this article, we apply our results to some data from Wireless Sensor Network for Monitoring Water Quality from a Lake.
1. Introduction
Field data we often encounter from the study of the environment are usually structured according to geometry and the connectivity between the locations that make up the environment. Example,data from (i) monitoring air quality at key industrial sites, (ii) looking for key contaminating agents from the exhausts of public buses, (iii) monitoring the cleanliness in lakes and many more, are all structured according to the geometry of the area of study and the connectivity of the location that make up the environment. To design and implement simplex (Linear programming) algorithm for the solution of generalized network flow problems of the geometric structured network data ,see example [1], or to find an efficient coding scheme or an approximate pattern matching algorithms, see example [2], we need an information theory for such data structures, and the lossy Asymptotic Equipartition Property (AEP) for the geometric networked data structures is key to finding an information theory for the data structure. See [6] and [7] for similar results for other types of data structures.
The aim of this article is to extend the Lossy AEP for Networked Data Structures modelled as Coloured Random Graph (CRG), see [6, Theorem 2.1], to cover the WSN. To be specific we model the Geometric Networked Data Structures ( WSN) as a CGRG and use some of the large deviation techniques developed in [7] to prove a strong law of large numbers (SLLN), see Lemma 3.3, for the random network. Using the SLLN and the techniques deployed in [9] we extend the Lossy AEP to cover the WSN.
The remaining part of the paper is organized as follows: Section 2 contains the main result of the paper and an application to some data from environmental science. See, Theorem 2.1 in Subsection 2.1 and the application in Subsection 2.2. Section 3 gives the proof of the main result; starting with the LDPs (Lemmas 3.1 and 3.2) in Subsection 3.1, followed by statement and proof of a strong law of large numbers, see Lemma 3.3 and ending with derivation of the main results from the SLLN in subsection 3.2.
2. Generalized AEP for CGRG Process
2.1. Main Result
We consider two CGRG processes X^{[z]}=\big{\{}(X(z_{1}),X(z_{2})):\,z_{i}\,z_{j}\in E,\,i,j=1,2,3,...,n,i\not=j\big{\}} and Y^{[z]}=\big{\{}(Y(z_{i}),Y(z_{j})):\,z_{i}\,z_{j}\in E,\,i,j=1,2,3,...,n,i\not=j\big{\}} which take values in and resp., the spaces of finite graphs on and We equip , with their Borel fields and Let and denote the probability measures of the entire processes and By and we denote the coloured geometric random graphs and conditioned to have empirical colour measure and empirical pair measure See, example [3]. We always assume that and are independent of each other.
By we denote a finite alphabet and denote by the space of counting measure on equipped with the discrete topology. By we denote the space of probability measures on equipped with the weak topology and the space of finite measures on equipped with the weak topology.
We define the process-level empirical measure induced by and on by
[TABLE]
[TABLE]
for
Throughout the rest of the article we will assume that and are CGRG processes, See [11]. For , let denote the marginal distribution of on taking with respect to and denote the marginal distribution on with respect to
Let be an arbitrary non-negative function and define a sequence of single-letter distortion measures by
[TABLE]
where and Given and , we denote the distortion-ball of radius by
[TABLE]
We shall call the measure consistent if , are both consistent marginals of Refer to [7, Equation 2.1] for the concept of consistent measures.
For we write
[TABLE]
and define the rate function by
[TABLE]
where
[TABLE]
By we mean has distribution For we write
[TABLE]
Assume
[TABLE]
For we write
[TABLE]
and
[TABLE]
Theorem 2.1 (ii) below provides a Lossy AEP for WSN data structures.
Theorem 2.1**.**
Suppose and are CGRG process. Assume are bounded function. Then,
- (i)
with probability conditional on the event \big{\{}\,\Psi({\mathcal{L}}_{n,[z],1})=\Psi({\mathcal{L}}_{n,[z],2})=(\pi,\omega)\big{\}} the random variables \Big{\{}\sigma^{(n)}(x,Y^{[z]})\Big{\}} satisfy an LDP with deterministic, convex rate-function
[TABLE]
- (ii)
for all \alpha\in\Big{(}\alpha_{min}(\pi,\omega),\,\alpha_{av}(\pi,\omega)\Big{)}, except possibly at
[TABLE]
where
2.2. Application:
Wireless Sensor Network for Monitoring Water Quality from a Lake. Let consider a WSN (to monitor the cleanliness in lakes, particularly those used as sources of drinking water) consisting of sensors capable of carrying out some processing, gathering sensory information and communicating with other connected nodes in the network modelled as coloured geometric random graph on location, say By we denote sensors capable of carrying out some processing, gathering sensory information while communicating with other sensors and sensors gathering sensory information while communicating with other sensors. Suppose the locations are partition into block of and block of and number of communication links divided into different interactions, respectively, for a function which depends on the connectivity radius of the WSN. Assume converges and converges If we take then, by Theorem 2.1 we have the rate-distortion of
[TABLE]
where \omega^{\Delta(d)}(a,b)=\mbox{\frac{\pi^{d/2}}{\big{[}d/2\big{]}!}}\lambda_{[d]}(a,b))\pi(a)\pi(b). See, [6] for the relationship between the connectivity radius and We refer to [13] for more on modelling of the physical environment using the Wireless Sensor Network.
3. Proof of Theorem 2.1.
3.1. LDPs.
Recall from [7] that X=\big{\{}(X(u),X(v)):\,uv\in E\big{\}} and Y=\big{\{}(Y(u),Y(v)):\,uv\in E\big{\}} are CRG processes with values from and resp., the spaces of finite graphs on
We define the process-level empirical measure induced by and on by
[TABLE]
Lemma 3.1** (Exponential Equivalence).**
Suppose are CGRG on the dimensional Torus and are CRG. Then, conditional on the event \big{\{}\,\Psi({\mathcal{L}}_{n,^{[}z],1})=\Psi({\mathcal{L}}_{n,[z],2})=\Psi({\mathcal{L}}_{n,1})=\Psi({\mathcal{L}}_{n,2})=(\pi,\omega)\big{\}} the law of is exponentially equivalent to the law of
Proof.
We denote by the random allocation process and notice from [6, Lemma 3.1] and [5, Lemma 0.4] that conditional on \big{\{}\,\Psi({\mathcal{L}}_{n,[z],1})=\Psi({\mathcal{L}}_{n,[z],2})=\Psi({\mathcal{L}}_{n,1})=\Psi({\mathcal{L}}_{n,2})=(\pi,\omega)\big{\}} the law of is exponentially equivalent to the law of and the law of is exponentially equivalent to the law of Therefore, conditional on \big{\{}\,\Psi({\mathcal{L}}_{n,[z],1})=\Psi({\mathcal{L}}_{n,[z],2})=\Psi({\mathcal{L}}_{n,1})=\Psi({\mathcal{L}}_{n,2})=(\pi,\omega)\big{\}} we have exponentially equivalent to
∎
Lemma 3.2** (LDP).**
Suppose are coloured geometric random graph on the dimensional Torus. Then, conditional on the event \big{\{}\,\Psi({\mathcal{L}}_{n,[z],1})=\Psi({\mathcal{L}}_{n,[z],2})=(\pi,\omega)\big{\}} the law of obeys a process level LDP with good rate function
The proof of this Lemma 3.2 which follows from 3.1 [7, Theorem ] and [10, Theorem 4.2.13], is omitted from the paper.
3.2. Derivation of the AEP.
We write and define the set by
[TABLE]
Lemma 3.3** (SLLN).**
Suppose the sequence of measures converges to the pair of measures For any we have \lim_{n\to\infty}\mathbb{P}_{(\pi_{n},\omega_{n})}\big{(}{\mathcal{C}}_{\pi\omega}^{\varepsilon}\big{)}=0.
Observe that defined above is a closed subset of and so by Lemma 3.2 we have that
[TABLE]
We use proof by contradiction to show that the right hand side of (3.1) is negative.Suppose that there exists sequence in such that Then, there is a limit point with Note is a good rate function and its level sets are compact, and the mapping ) lower semi-continuity. Now implies for all which contradicts .
(i) Notice and if is open (closed) subset of then
[TABLE]
is also open (closed) set since is bounded function.
[TABLE]
(ii) Observe that are bounded, therefore by Varadhan’s Lemma and convex duality, we have
[TABLE]
where
[TABLE]
exits for almost everywhere Using bounded convergence, we can show that
[TABLE]
Using Lemma 3.3, by boundedness of we have that
[TABLE]
Also let
[TABLE]
so that for , while for Observe that for we have \alpha_{min}^{(n)}(\pi,\omega)=\mathbb{E}_{P_{x}^{(n)}}\big{[}{\rm essinf}\,_{Y\,\approx\,Q_{y}^{(n)}}\sigma^{(n)}(X^{[z]},Y^{[z]})\big{]}, which converges to Applying similar arguments as [9, Proposition 2] we obtain
[TABLE]
Now we observe from [9, Page 41] that the converge of is uniform on compact subsets of Moreover, is convex, continuous functions converging informally to and hence we can invoke [12, Theorem 5] to obtain
[TABLE]
Applying similar arguments as [9, Page 41] in the lines after equation (64) we have (2.3) which completes the proof.
Conflict of Interest
The author declares that he has no conflict of interest.
Acknowledgement
This extension has been mentioned in the author’s PhD Thesis at University of Bath.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] I. Ali , A. Charnes and T. Song (1986). Design and Implementation of Data Structures for Generalized Networks. Journal of Information and Optimization Sciences, 7(), 81-104.
- 2[2] T.M. Cover and J.A. Thomas (1991). Elements of Information Theory. Wiley Series in Telecommunications, (1991).
- 3[3] K. Doku-Amponsah. (2006). Large deviations and basic information theory for hierarchical and networked data structures. Ph D Thesis, Bath (2006).
- 4[4] K. Doku-Amponsah (2012). Asymptotic equipartition propeties for hierarchical and networked structures. ESAIM: PS 16 (2012): 114-138.DOI: 10.1051/ps/2010016.
- 5[5] K. Doku-Amponsah (2014). Exponential Approximation, Method of types for Empirical Neighbourhood Measures of Random graphs by Random Allocation. Int. J. Stat. & Prob., 3(2),110-120 (2014).
- 6[6] K. Doku-Amponsah. (2017). Lossy Asymptotic Equipartition Property for Networked Data Structures. J. Math. & Stat. 13(2):152-158 •
- 7[7] K. Doku-Amponsah (2015). Joint large deviation result for empirical measures of the coloured random geometric graphs. Springer Plus.2016, 5:1140; Vol. 4, No. 1 (2015),pp. 87-93
- 8[8] K. Doku-Amponsah (2017). Lossy Asymptotic Equipartition Property For Hierarchical Data Structures. Far East J. Math. Sc. 101(5):1013-1024.
