Harmonic Analysis of Symmetric Random Graphs
Steffen Lauritzen

TL;DR
This paper explores the harmonic analysis of symmetric random graphs, representing their probability distributions as mixtures of characters on a semigroup, offering an alternative view of de Finetti's theorem for exchangeable graphs.
Contribution
It introduces a harmonic analysis framework on semigroups to understand graph limits and provides a new derivation of de Finetti's theorem for exchangeable graphs.
Findings
Representation of random graph distributions as mixtures of characters
Alternative derivation of de Finetti's theorem for graphs
Insight into graph limits via harmonic analysis
Abstract
This note attempts to understand graph limits as defined by Lovasz and Szegedy (2006)} in terms of harmonic analysis on semigroups. This is done by representing probability distributions of random exchangeable graphs as mixtures of characters on the semigroup of unlabeled graphs with node-disjoint union, thereby providing an alternative derivation of de Finetti's theorem for random exchangeable graphs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Harmonic Analysis of Symmetric Random Graphs
Dedicated to the memory of František Matúš
Steffen Lauritzen
University of Copenhagen
Abstract
This note attempts to understand graph limits as defined by Lovász and Szegedy, (2006) in terms of harmonic analysis on semigroups. This is done by representing probability distributions of random exchangeable graphs as mixtures of characters on the semigroup of unlabeled graphs with node-disjoint union, thereby providing an alternative derivation of de Finetti’s theorem for random exchangeable graphs.
Key words: characters, deFinetti’s theorem, exchangeability, extreme point models, graph limits, graphons, positive definite functions, semigroups.
1 Introduction
Random exchangeable graphs are fundamental for the analysis of network data, see for example Orbantz and Roy, (2015) and Lauritzen et al., (2018). Diaconis and Janson, (2008) showed that the modern theory of graph limits (Lovász and Szegedy,, 2006; Lovász,, 2012) gave a natural way of understanding such graphs.
Some of the arguments associated with establishing the connection between exchangeable graphs and graph limits can appear complicated. However, as earlier demonstrated by Ressel, (1985, 2008), the theory of positive definite functions on Abelian semigroups provides a simple, generic structure for understanding exchangeability, and this is what this note attempts to exploit and explain in somewhat larger detail. Also, this analysis establishes that the graphon based models for random exchangeable graphs can be understood as natural ‘exponential families’ for random graphs, in contrast to the so-called exponential random graph models (ERGMs) (Holland and Leinhardt,, 1981; Snijders et al.,, 2006).
An ultra short summary of the main points in the developments below has appeared as part of Lauritzen et al., (2019).
2 Harmonic analysis on semigroups
We shall use the following concepts from Berg et al., (1976), see also Berg et al., (1984). We consider an Abelian semigroup with neutral element [math] and identity involution. A function is a character if and only if
[TABLE]
that is, a character behaves like an exponential function. A function is positive definite if and only if
[TABLE]
in other words, a function is positive definite if and only if all matrices of the form are positive semidefinite.
We let denote the set of positive definite functions on , the set of all bounded positive definite functions, and the set of bounded positive definite functions with . When equipped with the topology of pointwise convergence, is a compact Hausdorff space.
We let denote the set of all bounded characters on and note that these form an Abelian semigroup under multiplication; is the dual semigroup to . We shall be using the following result from Berg et al., (1976).
Theorem 1** (Berg, Christensen, Ressel).**
The set is a Bauer simplex with the bounded characters as extreme points. In particular, any has a unique representation as barycentre of a probability measure on :
[TABLE]
and the set of bounded characters form a closed subset of .
3 Generalized exponential families
Generalized exponential families were introduced in Lauritzen, (1975) and studied further in Lauritzen, (1988) where Theorem 1 was exploited to identify these as so-called extreme point models for i.i.d. observations.
More precisely, a generalized exponential family of distributions on a discrete state space is determined as a set of probability mass functions of the form
[TABLE]
where is a canonical sufficient statistic with values in an Abelian semigroup , defines a base measure, are the (not necessarily bounded if is not uniform) characters, and is the canonical parameter space
[TABLE]
Note that is the Laplace transform of the lifted base measure on :
[TABLE]
These exponential families share many (but not all) properties of more standard exponential families which have the same form but for the special semigroup , usually considered as a vector space.
4 Graphs
For any we let and denote the set of simple labeled graphs and simple unlabeled graphs with node set ; further, we let be the empty graph, the set of infinite graphs with node set , , and .
For we let denote the equivalence class of all labeled graphs isomorphic to . We will also without ambiguity think of as an unlabeled graph.
If and are in , we write if is a subgraph of . If is a subset of the node set in a labeled graph , the subgraph induced by has as node set and edge set equal to all edges in wiht both endpoints in . For and , is the subgraph of induced by . An infinite graph in can be identified with its sequence of induced subgraphs ; such a sequence is consistent in the sense that for all we have .
For , , and , the permutation group on , we will let be the graph obtained from by relabeling its nodes according to . Thus in if and only if in .
5 Symmetric random graphs
We consider a probability distribution on and say this is symmetric if and only if
[TABLE]
We then also say the random graph or its distribution is exchangeable (Aldous,, 1981, 1985; Diaconis and Freedman,, 1981; Matúš,, 1995; Diaconis and Janson,, 2008; Lauritzen,, 2008).
It is practical to represent the distribution through its Möbius parameters (Drton and Richardson,, 2008; Lauritzen et al.,, 2018, 2019) where is the set of labeled graphs with no isolated nodes and
[TABLE]
The quantities and are related by the Möbius transform; if
[TABLE]
where is the complete graph on nodes and denotes the cardinality of the differencs of the corresponding edge set. A non-negative function is a valid Möbius parameter if and only if the left-most expression in (1) is non-negative for all . We note that the positivity condition is not so easy to verify for a given function , so it is of interest to derive alternative representations.
Clearly, is symmetric if and only if is symmetric, or, equivalently, if there is a function such that
[TABLE]
We also note that — where is (node disjoint) graph union — is an Abelian semigroup with the empty graph as its neutral element. So the map which maps any finite graph into its equivalence class: , appears as the canonical sufficient statistic in the generalized exponential family of exchangeable random graphs. Moreover we have
Lemma 1**.**
Let be a random exchangeable graph with Möbius parameter given as above. Then the function is bounded and positive definite on ; in other words, .
Proof.
Clearly and is bounded. Introduce the binary random variables for where if in and otherwise; is the (random) adjacency matrix of . Then, clearly
[TABLE]
So elementary calculations will verify that for and being node-disjoint we have
[TABLE]
where is a representative of which is node-disjoint from for all . This completes the proof. ∎
We note that the property in Lemma 1 is referred to as reflection positivity in Lovász and Szegedy, (2006). A de Finetti type representation of random symmetric graphs can now be obtained as a Corollary to Theorem 1:
Corollary 1** (deFinetti’s theorem for exchangeable random graphs).**
Let be the distribution of an random graph with Möbius parameter . Then is exchangeable if and only if there is a unique probability measure on such that for all
[TABLE]
We note in particular that the extreme points of the convex set of exchangeable measures — corresponding to the pure characters — are dissociated (Silverman,, 1976), i.e if and and are node disjoint subgraphs of it holds that
[TABLE]
or, in other words, we have for node-disjoint :
[TABLE]
6 Characters on the semigroup of unlabeled graphs
To get a more detailed version of de Finetti’s theorem for exchangeable graphs we need to identify the characters on that enter into the representation above since not all bounded characters will be valid in the sense that their Möbius transform would be non-negative. We shall say that such characters are fully positive and denote the set of these characters by . We consider first the homomorphism densities
[TABLE]
where and is the number of graph homomorphisms (edge preserving maps) from to and, as before and is the cardinality of the edge sets of and . These are multiplicative in the sense that for node disjoint subgraphs
[TABLE]
Noting that in fact only depends on through their isomorphism classes we can for define a character as
[TABLE]
In addition we shall need the injective homomorphism densities
[TABLE]
where and is the number of injective graph homomorphisms (edge preserving maps) from to , and
[TABLE]
is the falling factorial.
The homomorphism densities can be understood as Möbius parameters of a probability distributions of a random graph , where vertices in are sampled from with or without replacement respectively. If we let and denote the corresponding probability distributions, we have therefore the inequalities
[TABLE]
and therefore in particular
[TABLE]
see Freedman, (1977) and Lemma 3.3 in Lauritzen et al., (2019) or Lemma 2.1 in Lovász and Szegedy, (2006) who give the slightly weaker bound to the right.
In particular we note that for any fixed , the right-hand side in (5) tends to zero as .
Note now that for any exchangeable random graph with Möbius parameter , the law of total probability yields
[TABLE]
where the induced measure on , We then have
Theorem 2**.**
For every , the function is a bounded and fully positive character in . Further, these characters are dense in : if we let denote the closure of within (pointwise convergence), we have .
Proof.
For any , is clearly a bounded and fully positive character by (3). Since the set of bounded characters is closed by Theorem 1 and therefore also the set of fully positive characters, we clearly have . We need to establish the reverse inequality.
Thus let and let be the corresponding probability measure on with the induced measure on . By (5) and (7) it then holds for any and any that
[TABLE]
where
[TABLE]
Using (4) we can rewrite (8) as
[TABLE]
which now holds for all .
Since is compact, so is the set of probability measures on and hence the sequence must have an accumulation point which then satisfies for all :
[TABLE]
Now, as the integral representation is unique by Theorem 1 and Corollary 1, we must have concentrated on and thus which completes the proof. ∎
Note that, in effect, (9) yields a finite deFinetti type representation for an exchangeable random graph, see also Diaconis and Freedman, (1980) and Lauritzen et al., (2019). We state this results as its own corollary and note that this is in fact Theorem of Matúš, (1995):
Corollary 2** (deFinetti’s theorem for finitely exchangeable random graphs).**
Let be the induced distribution of where is a finitely exchangeable random graph in . Then there is an exchangeable random graph such that the distribution of satisfies
[TABLE]
where is the total variation norm.
Proof.
We define by its Möbius parameter as
[TABLE]
or, equivalently
[TABLE]
and now (5) yields
[TABLE]
which was required. ∎
Elements of can naturally be interpreted as limits of unlabeled graphs (Lovász and Szegedy,, 2006; Borgs et al.,, 2008; Lovász,, 2012) by the embedding and
[TABLE]
so we can write .
In addition, the characters can be represented by (equivalence classes of) functions also known as a graphons, see references above, corresponding to adjacency matrices of infinitely exhangeable random arrays (Aldous,, 1981; Hoover,, 1979; Diaconis and Freedman,, 1981). This is contained in the following result:
Theorem 3**.**
The fully positive and bounded characters on are exactly the functions satisfying for
[TABLE]
for some measurable, symmetric function The function is unique up to measure-preserving transformations of the unit interval.
Proof.
It is straightforward to see that these are fully positive and bounded characters. Also for every graph we can construct a specific by partitioning the unit interval into subintervals
[TABLE]
and letting for being a representative of
[TABLE]
and then show that these converge in a suitable metric exactly when . We refrain from giving the details of this and refer to Lovász and Szegedy, (2006) or Lovász, (2012). ∎
The graphon representation of a graph limit has its advantages, but also its disadvantages in that it can be difficult to identify exactly when two graphons and are representing the same character since there are many measure-preserving transformations of the unit interval. In that sense, the representation as a character is more direct and there is a one-to-one correspondence between and the corresponding random graph. However, in general, it is not so easy to decide whether a given function specifies a valid probability distribution, i.e. satisfies the positivity restriction in (1).
Acknowledgement
I am grateful to an anonymous referee for sharp and constructive comments, correcting substantial shortcomings in the exposition.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Aldous, (1981) Aldous, D. (1981). Representations for partially exchangeable random variables. Journal of Multivariate Analysis , 11:581–598.
- 2Aldous, (1985) Aldous, D. (1985). Exchangeability and related topics. In Hennequin, P., editor, École d’Été de Probabilités de Saint–Flour XIII — 1983 , pages 1–198. Springer-Verlag, Heidelberg. Lecture Notes in Mathematics 1117.
- 3Berg et al., (1976) Berg, C., Christensen, J. P. R., and Ressel, P. (1976). Positive definite functions on Abelian semigroups. Mathematische Annalen , 259:253–274.
- 4Berg et al., (1984) Berg, C., Christensen, J. P. R., and Ressel, P. (1984). Harmonic Analysis on Semigroups . Springer-Verlag, New York.
- 5Borgs et al., (2008) Borgs, C., Chayes, J., Lovász, L., Sós, V., and Vesztergombi, K. (2008). Convergent sequences of dense graphs I: Subgraph frequencies, metric properties and testing. Advances in Mathematics , 219:1801–1851.
- 6Diaconis and Freedman, (1980) Diaconis, P. and Freedman, D. (1980). Finite exchangeable sequences. Annals of Probability , 8:745–764.
- 7Diaconis and Freedman, (1981) Diaconis, P. and Freedman, D. (1981). On the statistics of vision: the Julesz conjecture. Journal of Mathematical Psychology , 24:112–138.
- 8Diaconis and Janson, (2008) Diaconis, P. and Janson, S. (2008). Graph limits and exchangeable random graphs. Rendiconti di Matematica, Serie VII , 28:33–61.
