Constructing Adjacency Arrays from Incidence Arrays

Hayden Jananthan; Karia Dibert; Jeremy Kepner

arXiv:1702.07832·cs.DS·September 19, 2017

Constructing Adjacency Arrays from Incidence Arrays

Hayden Jananthan, Karia Dibert, Jeremy Kepner

PDF

TL;DR

This paper establishes mathematical criteria for constructing adjacency arrays from incidence arrays in graph processing, detailing how different algebraic operations influence the resulting structure, with practical illustrations.

Contribution

It provides the necessary mathematical conditions for accurately deriving adjacency arrays from incidence arrays using various algebraic operations.

Findings

01

Criteria for adjacency array construction established

02

Impact of different algebraic operations analyzed

03

Practical examples using music metadata provided

Abstract

Graph construction, a fundamental operation in a data processing pipeline, is typically done by multiplying the incidence array representations of a graph, $E_{in}$ and $E_{out}$ , to produce an adjacency array of the graph, $A$ , that can be processed with a variety of algorithms. This paper provides the mathematical criteria to determine if the product $A = E_{out}^{T} E_{in}$ will have the required structure of the adjacency array of the graph. The values in the resulting adjacency array are determined by the corresponding addition $\oplus$ and multiplication $\otimes$ operations used to perform the array multiplication. Illustrations of the various results possible from different $\oplus$ and $\otimes$ operations are provided using a small collection of popular music metadata.

Equations79

\oplus

\oplus

\otimes

A = E_{out}^{T} E_{in}

A = E_{out}^{T} E_{in}

v \oplus 0

v \oplus 0

v \otimes 1

A^{T} (k_{2}, k_{1}) = A (k_{1}, k_{2})

A^{T} (k_{2}, k_{1}) = A (k_{1}, k_{2})

C = A \oplus . \otimes B = AB

C = A \oplus . \otimes B = AB

C (k_{1}, k_{2}) = k_{3} ⨁ A (k_{1}, k_{3}) \otimes B (k_{3}, k_{2})

C (k_{1}, k_{2}) = k_{3} ⨁ A (k_{1}, k_{3}) \otimes B (k_{3}, k_{2})

A : K_{1} \times K_{3} \to V

A : K_{1} \times K_{3} \to V

B : K_{3} \times K_{1} \to V

C : K_{1} \times K_{2} \to V

E_{out} (k, a) E_{in} (k, a) = E_{out}^{T} (a, k) E_{in} (k, a)

E_{out} (k, a) E_{in} (k, a) = E_{out}^{T} (a, k) E_{in} (k, a)

(E_{out}^{T} E_{in}) (a, b) = k \in K ⨁ E_{out}^{T} (a, k) E_{in} (k, b)

(E_{out}^{T} E_{in}) (a, b) = k \in K ⨁ E_{out}^{T} (a, k) E_{in} (k, b)

E_{out}^{T} (k_{out}, k) \neq = 0

E_{out}^{T} (k_{out}, k) \neq = 0

E_{in} (k, k_{in}) \neq = 0

k \in K ⨁ E_{out}^{T} (k_{out}, k) \otimes E_{in} (k, k_{in}) \neq = 0 ⟺ \exists k \in K so that E_{out}^{T} (k_{out}, k) \neq = 0 and E_{in} (k, k_{in}) \neq = 0

k \in K ⨁ E_{out}^{T} (k_{out}, k) \otimes E_{in} (k, k_{in}) \neq = 0 ⟺ \exists k \in K so that E_{out}^{T} (k_{out}, k) \neq = 0 and E_{in} (k, k_{in}) \neq = 0

k \in K ⨁ E_{out} (k, x) \otimes E_{in} (k, y) = 0 ⟺ ∄ k \in K \mbox so t ha t E_{out} (k, x) \neq = 0 and E_{in} (k, y) \neq = 0

k \in K ⨁ E_{out} (k, x) \otimes E_{in} (k, y) = 0 ⟺ ∄ k \in K \mbox so t ha t E_{out} (k, x) \neq = 0 and E_{in} (k, y) \neq = 0

k \in K ⨁ E_{out} (k, x) \otimes E_{in} (k, y) = 0 ⟺ \forall k \in K, E_{out} (k, x) = 0 or E_{in} (k, y) = 0

k \in K ⨁ E_{out} (k, x) \otimes E_{in} (k, y) = 0 ⟺ \forall k \in K, E_{out} (k, x) = 0 or E_{in} (k, y) = 0

k \in K ⨁ E_{out} (k, x) \otimes E_{in} (k, y) = 0 \Rightarrow \forall k \in K, E_{out} (k, x) = 0 or E_{in} (k, y) = 0

k \in K ⨁ E_{out} (k, x) \otimes E_{in} (k, y) = 0 \Rightarrow \forall k \in K, E_{out} (k, x) = 0 or E_{in} (k, y) = 0

\forall k \in K, E_{out} (k, x) = 0 \mbox or E_{in} (k, y) = 0 \Rightarrow k \in K ⨁ E_{out} (k, x) \otimes E_{in} (k, y) = 0

\forall k \in K, E_{out} (k, x) = 0 \mbox or E_{in} (k, y) = 0 \Rightarrow k \in K ⨁ E_{out} (k, x) \otimes E_{in} (k, y) = 0

E_{out} (k_{1}, a) = v

E_{out} (k_{1}, a) = v

E_{out} (k_{2}, a) = w

E_{in} (k_{i}, b) = 1

E_{out}^{T} E_{in} (b, a) = (v \otimes 1) \oplus (w \otimes 1) = v \oplus w = 0

E_{out}^{T} E_{in} (b, a) = (v \otimes 1) \oplus (w \otimes 1) = v \oplus w = 0

E_{out} (k, a) = v

E_{out} (k, a) = v

E_{in} (k, a) = w

E_{out}^{T} E_{in} (a, a) = E_{out} (k, a) \otimes E_{in} (k, a) = v \otimes w = 0

E_{out}^{T} E_{in} (a, a) = E_{out} (k, a) \otimes E_{in} (k, a) = v \otimes w = 0

E_{out} (k_{1}, a) = v = E_{in} (k_{1}, a)

E_{out} (k_{1}, a) = v = E_{in} (k_{1}, a)

E_{out} (k_{2}, b) = v = E_{in} (k_{2}, b)

E_{out} (k_{2}, b) = v = E_{in} (k_{2}, b)

0

0

\exists k \in K so that E_{out} (k, x) \neq = 0 and E_{in} (k, y) \neq = 0 \Rightarrow k \in K ⨁ E_{out} (k, x) \otimes E_{in} (k, y) \neq = 0

\exists k \in K so that E_{out} (k, x) \neq = 0 and E_{in} (k, y) \neq = 0 \Rightarrow k \in K ⨁ E_{out} (k, x) \otimes E_{in} (k, y) \neq = 0

\forall k \in K, E_{out} (e, x) = 0 or E_{in} (e, y) = 0 \Rightarrow k \in ⨁ E_{out} (k, x) \otimes E_{in} (k, y) = 0

\forall k \in K, E_{out} (e, x) = 0 or E_{in} (e, y) = 0 \Rightarrow k \in ⨁ E_{out} (k, x) \otimes E_{in} (k, y) = 0

\overset{ˉ}{E}_{out}^{T} \overset{ˉ}{E}_{in} = E_{in}^{T} E_{out}

\overset{ˉ}{E}_{out}^{T} \overset{ˉ}{E}_{in} = E_{in}^{T} E_{out}

0 \otimes 1 = 1 \otimes 0 = 0

0 \otimes 1 = 1 \otimes 0 = 0

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Constructing Adjacency Arrays from Incidence Arrays

Hayden Jananthan1,2 Karia Dibert2,3 Jeremy Kepner2,3,4

1Vanderbilt University Mathematics Department, 2MIT Lincoln Laboratory Supercomputing Center,

3MIT Mathematics Department, 4MIT Computer Science & AI Laboratory

Abstract

Graph construction, a fundamental operation in a data processing pipeline, is typically done by multiplying the incidence array representations of a graph, $\mathbf{E}_{\mathrm{in}}$ and $\mathbf{E}_{\mathrm{out}}$ , to produce an adjacency array of the graph, $\mathbf{A}$ , that can be processed with a variety of algorithms. This paper provides the mathematical criteria to determine if the product $\mathbf{A}=\mathbf{E}^{\sf T}_{\mathrm{out}}\mathbf{E}_{\mathrm{in}}$ will have the required structure of the adjacency array of the graph. The values in the resulting adjacency array are determined by the corresponding addition $\oplus$ and multiplication $\otimes$ operations used to perform the array multiplication. Illustrations of the various results possible from different $\oplus$ and $\otimes$ operations are provided using a small collection of popular music metadata.

Index Terms:

graph; incidence array; adjacency array; semiring

I Introduction

††footnotetext: This material is based in part upon work supported by the NSF under grant number DMS-1312831. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

The duality between the canonical representation of graphs as abstract collections of vertices and edges and a matrix representation has been a part of graph theory since its inception [Konig 1931, Konig 1936]. Matrix algebra has been recognized as a useful tool in graph theory for nearly as long [Harary 1969, Sabadusi 1960, Weischel 1962, McAndrew 1963, Teh & Yap 1964, McAndrew 1965, Harary & Tauth 1964, Brualdi 1967]. The modern description of the duality between graph algorithms and matrix mathematics (or sparse linear algebra) has been extensively covered in the recent literature [Kepner & Gilbert 2011] and has further spawned the development of the GraphBLAS math library standard (GraphBLAS.org)[Mattson et al 2013] that has been developed in a series of proceedings [Mattson 2014a, Mattson 2014b, Mattson 2015, Buluç 2015, Mattson 2016] and implementations [Buluç & Gilbert 2011, Kepner et al 2012, Ekanadham et al 2014, Hutchison et al 2015, Anderson et al 2016, Zhang et al 2016].

Adjacency arrays, typically denoted $\mathbf{A}$ , have much in common with adjacency matrices. Likewise, incidence arrays or edge arrays, typically denoted $\mathbf{E}$ , have much in common with incidence matrices [Bruck & Ryser 1949, Ford & Fulkerson 1962, Fulkerson & Gross 1965, Fisher & Wing 1965], edge matrices [Dobrjanskyj & Freudenstein 1967], adjacency lists [Bodin & Kursh 1979], and adjacency structures [Tarjan 1972]. The powerful link between adjacency arrays and incidence arrays via array multiplication is the focus of the first part of this paper.

Incidence arrays are often readily obtained from raw data. In many cases, an associative array representing a spreadsheet or database table is already in the form of an incidence array. However, to analyze a graph, it is often convenient to represent the graph as an adjacency array. Constructing an adjacency array from data stored in an incidence array via array multiplication is one of the most common and important steps in a data processing system.

Given a graph $G$ with vertex set $K_{\mathrm{out}}\cup K_{\mathrm{in}}$ and edge set $K$ , the construction of adjacency arrays for $G$ relies on the assumption that $\mathbf{E}^{\sf T}_{\mathrm{out}}\mathbf{E}_{\mathrm{in}}$ is an adjacency array of $G$ . This assumption is certainly true in the most common case where the value set is composed of non-negative reals and the operations $\oplus$ and $\otimes$ are arithmetic plus ( $+$ ) and arithmetic times ( ${\times}$ ) respectively. However, one hallmark of associative arrays is their ability to contain as values nontraditional data. For these value sets, $\oplus$ and $\otimes$ may be redefined to operate on non-numerical values. For example, for the value of all alphanumeric strings, with

[TABLE]

it is not immediately apparent in this case whether $\mathbf{E}^{\sf T}_{\mathrm{out}}\mathbf{E}_{\mathrm{in}}$ is an adjacency array of the graph whose set of vertices is $K_{\mathrm{out}}\cup K_{\mathrm{in}}$ . In the subsequent sections, the criteria on the value set $V$ and the operations $\oplus$ and $\otimes$ are presented so that

[TABLE]

always produces an adjacency array [Dibert et al 2015].

I-A Definitions

For a directed graph (from here onwards, just ‘graph’) $G$ , $K_{\mathrm{out}}$ will denote the set of vertices which are the sources of edges, $K_{\mathrm{in}}$ will denote the set of vertices which are the targets of edges, and $K$ will denote the set of edges. The vertex set of $G$ will be assumed to be $K_{\mathrm{out}}\cup K_{\mathrm{in}}$ . $K_{\mathrm{out}}$ , $K_{\mathrm{in}}$ , and $K$ are assumed to be finite and totally-ordered.

$V$ will denote the set of values that the data can take on, such as non-negative real numbers or the elements of an ordered set. $\oplus$ and $\otimes$ are binary operations on $V$ (in particular, $V$ is closed under the operations $\oplus$ and $\otimes$ ), such as $\oplus=+$ and $\otimes=\times$ or $\oplus=\max$ and $\otimes=+$ . $\oplus$ and $\otimes$ each have identity elements [math] and $1$ , respectively, i.e.

[TABLE]

for all $v\in V$ .

For the purposes of understanding what algebraic properties are required for $\mathbf{E}^{\sf T}_{\mathrm{out}}\mathbf{E}_{\mathrm{in}}$ to be an adjacency array of a graph, $\oplus$ and $\otimes$ will not be assumed to be associative or commutative, and $\otimes$ does not necessarily distribute over $\oplus$ , nor is [math] assumed to be an annihilator of $\otimes$ .

Definition I.1 (Associative Array).

An associative array is a map $\mathbf{A}:K_{1}{\times}K_{2}\to V$ , where $K_{1}$ and $K_{2}$ are finite totally-ordered sets, referred to as key sets and whose elements are called keys, and $V$ is the value set.

Definition I.2 (Transpose).

If $\mathbf{A}:K_{1}{\times}K_{2}\to V$ is an associative array, then $\mathbf{A}^{\sf T}:K_{2}{\times}K_{1}\to V$ is the associative array defined as

[TABLE]

where $k_{1}\in K_{1}$ and $k_{2}\in K_{2}$ .

Definition I.3 (Array Multiplication).

Multiplication of associative arrays is defined as

[TABLE]

or more specifically

[TABLE]

where $\mathbf{A}$ , $\mathbf{B}$ , and $\mathbf{C}$ are associative arrays

[TABLE]

and $k_{1}\in K_{1}$ , $k_{2}\in K_{2}$ , $k_{3}\in K_{3}$ .

Definition I.4 (Incidence Arrays).

If $G$ is a graph with vertex set $K_{\mathrm{out}}\cup K_{\mathrm{in}}$ and edge set $K$ , then

$\mathbf{E}_{\mathrm{out}}$

: $K\times K_{\mathrm{out}}\to V$ is a source incidence array if $\mathbf{E}_{\mathrm{out}}(k,a)\neq 0$ if and only if the edge $k\in K$ is directed outward from the vertex $a\in K_{\mathrm{out}}$

$\mathbf{E}_{\mathrm{in}}$

: $K\times K_{\mathrm{in}}\to V$ is a target incidence array if $\mathbf{E}_{\mathrm{in}}(k,a)\neq 0$ if and only if the edge $k\in K$ is directed into the vertex $a\in K_{\mathrm{in}}$ .

Definition I.5 (Adjacency Array).

If $G$ is a graph with vertex set $K_{\mathrm{out}}\cup K_{\mathrm{in}}$ and edge set $K$ , then $\mathbf{A}:K_{\mathrm{out}}{\times}K_{\mathrm{in}}\to V$ is a adjacency array if $\mathbf{A}(a,b)\neq 0$ if and only if there is an edge with source $a$ and target $b$ .

II Adjacency Array Construction

If $\mathbf{A}$ is an adjacency array for a graph $G=(K_{\mathrm{out}}\cup K_{\mathrm{in}},K)$ , then $\mathbf{A}(a,b)\neq 0$ if and only if there is an edge $k$ with source $a$ and target $b$ , i.e. so that $\mathbf{E}_{\mathrm{out}}(k,a)\neq 0$ and $\mathbf{E}_{\mathrm{in}}(k,a)\neq 0$ . In the case where the product of two non-zero values is non-zero, this can be subsumed to say that $\mathbf{A}(a,b)\neq 0$ if and only if $\mathbf{E}_{\mathrm{out}}(k,a)\mathbf{E}_{\mathrm{in}}(k,a)$ . Writing this as

[TABLE]

This latter expression looks like a term in the evaluation

[TABLE]

but the introduction of more terms means that more assumptions need to be made about the relationships between $\oplus,\otimes$ , and [math].

Theorem II.1.

Let $V$ be a set with closed binary operations $\oplus,\otimes$ with identities $0,1\in V$ . Then the following are equivalent:

$\oplus$ * and $\otimes$ satisfy the properties*

(a)

Zero-Sum-Free: $a\oplus b=0$ if and only if $a=b=0$ , 2. (b)

No Zero Divisors: $a\otimes b=0$ if and only if $a=0$ or $b=0$ , and 3. (c)

[math]* is Annihilator for $\otimes$ : $a\otimes 0=0\otimes a=0$ .* 2. 2.

If $G$ is a graph with out-vertex and in-vertex incidence arrays $\mathbf{E}_{\mathrm{out}}:K{\times}K_{\mathrm{out}}\rightarrow V$ and $\mathbf{E}_{\mathrm{in}}:K{\times}K_{\mathrm{out}}\rightarrow V$ , then $\mathbf{E}_{\mathrm{out}}^{\sf T}\mathbf{E}_{\mathrm{in}}$ is an adjacency array for $G$ .

Proof.

Let $\mathbf{A}=\mathbf{E}_{\mathrm{out}}^{\sf T}\mathbf{E}_{\mathrm{in}}$ .

As above, for $\mathbf{A}$ to be the adjacency array of $G$ , the entry $\mathbf{A}(k_{\mathrm{out}},k_{\mathrm{in}})$ must be nonzero if and only if there is an edge from $k_{\mathrm{out}}$ to $k_{\mathrm{in}}$ , which is equivalent to saying that the entry must be nonzero if and only if there is a $k\in K$ such that

[TABLE]

Taken altogether, the above pair of equations imply

[TABLE]

First, the above condition can be restated in a form that more easily provides the zero-sum-freeness of $\oplus$ , lack of zero-divisors for $\otimes$ , and the fact that [math] annihilates under $\otimes$ . Equation II is equivalent to

[TABLE]

which in turn is equivalent to

[TABLE]

This expression may be split up into two conditional statements

[TABLE]

and

[TABLE]

Lemma II.2.

Equation 3 implies that $V$ is zero-sum-free.

Proof.

Suppose there exist nonzero $v,w\in V$ such that $v\oplus w=0$ , or that nontrivial additive inverses exist. Then it is possible to choose a graph $G$ to have edge set $\{k_{1},k_{2}\}$ and vertex set $\{a,b\}$ , where both $k_{1},k_{2}$ start from $a$ and end at $b$ . Then defining

[TABLE]

provides proper out-vertex and in-vertex incidence arrays for $G$ . Moreover, it is the case that

[TABLE]

which contradicts Equation 3. Therefore, no such nonzero $v$ and $w$ may be present in $V$ , meaning it is necessary that $V$ be zero-sum-free. ∎

Lemma II.3.

Equation 3 implies that $V$ has no zero-divisors.

Proof.

Suppose $v\otimes w=0$ . Define the graph $G$ to have edge set $\{k\}$ and vertex set $\{a\}$ with a single self-loop given by $k$ . Then define

[TABLE]

to obtain out-vertex and in-vertex incidence arrays for $G$ . Then

[TABLE]

Thus, Equation 3 implies that $v=w=0$ , and hence $V$ has no zero-divisors. ∎

Lemma II.4.

Equation 3 implies that [math] annihilates $V$ under $\otimes$ .

Proof.

Suppose $v\in V$ . Define the graph $G$ to have edge set $\{k_{1},k_{2}\}$ and vertex set $\{a,b\}$ , with self-loops at $a$ and $b$ given by $k_{1}$ and $k_{2}$ , respectively. Define

[TABLE]

and

[TABLE]

(and all other entries in $\mathbf{E}_{\mathrm{out}}$ and $\mathbf{E}_{\mathrm{in}}$ equal to [math]) results in out-vertex and in-vertex incidence arrays of $G$ . Moreover, it is true that

[TABLE]

By Lemma II.2, $V$ is zero-sum-free so it follows that $v\otimes 0=0\otimes v=0$ . Thus, [math] is an annihilator for $\otimes$ . ∎

Now Theorem II.1(i) is shown to be sufficient for Theorem II.1(ii) to hold. Assume that zero is an annihilator, $V$ is zero-sum-free, and $V$ has no zero-divisors. Zero-sum-freeness and the nonexistence of zero divisors give

[TABLE]

which is the contrapositive of Equation 3. And, that zero is an annihilator gives

[TABLE]

which is (4). As Equation 3 and Equation 4 combine to form Equation II, it is established that the conditions are sufficient for Equation II. ∎

III Adjacency Array of Reverse Graph

The remaining product of the incidence arrays that is defined is $\mathbf{E}^{\sf T}_{\mathrm{in}}\mathbf{E}_{\mathrm{out}}$ . The above requirements will now be shown to be necessary and sufficient for the remaining product to be the adjacency array of the reverse of the graph. Recall that the reverse of $G$ is the graph $\bar{G}$ in which all the arrows in $G$ have been reversed. Let $G$ be a graph with incidence matrices $\mathbf{E}_{\mathrm{out}}$ and $\mathbf{E}_{\mathrm{in}}$ .

Corollary III.1.

Condition (i) in Theorem II.1 are necessary and sufficient so that $\mathbf{E}^{\sf T}_{\mathrm{in}}\mathbf{E}_{\mathrm{out}}$ is an adjacency matrix of the reverse of $G$ .

Proof.

Let $\bar{G}$ denote the reverse of $G$ , and let $\bar{\mathbf{E}}_{\mathrm{out}}$ and $\bar{\mathbf{E}}_{\mathrm{in}}$ be out-vertex and in-vertex incidence arrays for $\bar{G}$ , respectively. Recall that $\bar{G}$ is defined to have the same edge and vertex sets as $G$ but changes the directions of the edges, in other words, if an edge $k$ leaves a vertex $a$ in $G$ , then it enters $a$ in $\bar{G}$ , and vice versa. As such, $\mathbf{E}_{\mathrm{out}}(k,a)\neq 0$ if and only if $\bar{\mathbf{E}}_{\mathrm{in}}(k,a)\neq 0$ , and likewise $\mathbf{E}_{\mathrm{in}}(k,a)\neq 0$ if and only if $\bar{\mathbf{E}}_{\mathrm{out}}(k,a)\neq 0$ . As such, choosing $\mathbf{E}_{\mathrm{out}}=\bar{\mathbf{E}}_{\mathrm{in}}$ and $\mathbf{E}_{\mathrm{in}}=\bar{\mathbf{E}}_{\mathrm{out}}$ gives valid in-vertex and out-vertex incidence matrices for $\bar{G}$ , respectively. Then by Theorem II.1 it can be shown that

[TABLE]

∎

It is now straightforward to identify algebraic structures that comply with the established criteria. Notably, all zero-sum-free semirings with no zero-divisors comply, such as $\mathbb{N}$ or $\mathbb{R}_{\geq 0}$ with the standard addition and multiplication. In addition, any linearly ordered set with $\oplus$ and $\otimes$ given by $\max$ and $\min$ , respectively. Some non-examples, however, include the max-plus algebra or non-trivial Boolean algebras, which do not satisfy the zero-product property, or rings, which except for the zero ring are not zero-sum-free. Furthermore, the value sets of associative arrays need not be defined exclusively as semirings, as several semiring-like structures satisfy the criteria. These structures may lack the properties of additive or multiplicative commutativity, additive or multiplicative associativity, or distributivity of multiplication over addition, which are not necessary to ensure that the product of incidence arrays yields an adjacency array.

The criteria guarantee an accurate adjacency array for any dataset that satisfies them, regardless of value distribution in the incidence arrays. However, if the incidence arrays are known to possess a certain structure, it is possible to circumvent some of the conditions and still always produce adjacency arrays. For example, if each key set of an undirected incidence array $\mathbf{E}$ is a list of documents and the array entries are sets of words shared by documents, then it is necessary that a word in $\mathbf{E}(i,j)$ and $\mathbf{E}(m,n)$ has to be in $\mathbf{E}(i,n)$ and $\mathbf{E}(m,j)$ . This structure means that when multiplying $\mathbf{E}^{\sf T}\mathbf{E}$ using $\oplus=\cup$ and $\otimes=\cap$ , a nonempty set will never be “multiplied” by (intersected with) a disjoint nonempty set. This eliminates the need for the zero-product property to be satisfied, as every multiplication of nonempty sets is already guaranteed to produce a nonempty set. The array produced will contain as entries a list of words shared by those two documents.

Though the criteria ensure that the product of incidence arrays will be an adjacency array, they do not ensure that certain matrix properties hold. For example, the property $(\mathbf{AB})^{\sf T}=\mathbf{B}^{\sf T}\mathbf{A}^{\sf T}$ may be violated under these criteria, as $(\mathbf{E}^{\sf T}_{\mathrm{out}}\mathbf{E}^{\sf T}_{\mathrm{in}})$ is not necessarily equal to $\mathbf{E}^{\sf T}_{\mathrm{in}}\mathbf{E}_{\mathrm{out}}$ . (For this matrix transpose property to always hold, the operation $\otimes$ would have to be commutative.)

IV Graph Construction with Different Semirings

The ability to change $\oplus$ and $\otimes$ operations allows different graph adjacency arrays to be constructed using the same element-wise addition, element-wise multiplication, and array multiplication syntax. Specific pairs of operations are best suited for constructing certain types of adjacency arrays. The pattern of edges resulting from array multiplication of incidence arrays is generally preserved for various semirings. However, the non-zero values assigned to the edges can be very different and enable the construction different graphs.

For example, constructing an adjacency array of the graph of music writers connected to music genres from Figure 1 begins with selecting the incidence sub-arrays $\mathbf{E}_{1}$ and $\mathbf{E}_{2}$ as shown in Figure 2. Array multiplication of $\mathbf{E}_{1}^{\sf T}$ with $\mathbf{E}_{2}$ produces the desired adjacency array of the graph. Figure 3 illustrates this array multiplication for different operator pairs $\oplus$ and $\otimes$ .

The pattern of edges among vertices in the adjacency arrays shown Figure 3 are the same for the different operator pairs, but the edge weights differ. All the non-zero values in $\mathbf{E}_{1}$ and $\mathbf{E}_{2}$ are 1. All the $\otimes$ operators in Figure 3 have the property

[TABLE]

for their respective values of zero be it 0, $\text{-}\infty$ , or $\infty$ . Likewise, all the $\otimes$ operators in Figure 3 also have the property

[TABLE]

except where $\otimes=+$ , in which case

[TABLE]

The differences in the adjacency array weights are less pronounced then if the values of $\mathbf{E}_{1}$ and $\mathbf{E}_{2}$ were more diverse. The most apparent difference is between the ${+}.{\times}$ semiring and the other semirings in Figure 3. In the case of ${+}.{\times}$ semiring, the $\oplus$ operation $+$ aggregates values from all the edges between two vertices. Additional positive edges will increase the overall weight in the adjacency array. In the other pairs of operations, the $\oplus$ operator is either $\max$ or $\min$ , which effectively selects only one edge weight to use for assigning the overall weight. Additional edges will only impact the edge weight in the adjacency array if the new edge is an appropriate maximum or minimum value. Thus, ${+}.{\times}$ constructs adjacency arrays that aggregate all the edges. The sother emirings construct adjacency arrays that select extremal edges. Each can be useful for construction graph adjacency arrays in appropriate context.

The impact of different semirings on the graph adjacency array weights are more pronounced if the values of $\mathbf{E}_{1}$ and $\mathbf{E}_{2}$ are more diverse. Figure 4 modifies $\mathbf{E}_{1}$ so that a value of 2 is given to the non-zero values in the column Genre $|$ Pop and a values of 3 is given to the non-zero values in the column Genre $|$ Rock.

Figure 5 shows the results of constructing adjacency arrays with $\mathbf{E}_{1}$ and $\mathbf{E}_{2}$ using different semirings. The impact of changing the values in $\mathbf{E}_{1}$ can be seen by comparing Figure 3 with Figure 5. For the ${+}.{\times}$ semiring, the values in the adjacency array rows Genre $|$ Pop and Genre $|$ Rock are multiplied by 2 and 3. The increased adjacency array values for these rows are a result of the $\otimes$ operator being arithmetic multiplication $\times$ so that

[TABLE]

For the ${\max}.{+}$ and ${\min}.{+}$ semirings, the values in the adjacency array rows Genre $|$ Pop and Genre $|$ Rock are larger by and 1 and 2. The larger values in the adjacency array of these rows is due to the $\otimes$ operator being arithmetic addition $+$ resulting in

[TABLE]

For the ${\max}.{\min}$ semiring, Figure 3 and Figure 5 have the same adjacency array because $\mathbf{E}_{2}$ is unchanged. The $\otimes$ operator corresponding to the minimum value function continues to select the smaller non-zero values from $\mathbf{E}_{2}$

[TABLE]

In contrast, for the ${\min}.{\max}$ semiring, the values in the adjacency array rows Genre $|$ Pop and Genre $|$ Rock are larger by and 1 and 2. The increase in adjacency array values for these rows are a result of the $\otimes$ operator selecting the larger non-zero values from $\mathbf{E}_{1}$

[TABLE]

Finally, for the ${\max}.{\times}$ and ${\min}.{\times}$ semirings, the values in the adjacency array rows Genre $|$ Pop and Genre $|$ Rock are increased by and 1 and 2. Similar to the ${+}.{\times}$ semiring, the larger adjacency array values for these rows are a result of the $\otimes$ operator being arithmetic multiplication $\times$ resulting in

[TABLE]

Figures 3 and 5 show that a wide range of graph adjacency arrays can be constructed via array multiplication of incidence arrays over different semirings. A synopsis of the graph constructions illustrated in Figures 3 and 5 is as follows

${+}.{\times}$

sum of products of edge weights connecting two vertices; computes the strength of all connections between two connected vertices.

${\max}.{\times}$

maximum of products edge weights connecting two vertices; selects the edge with largest weighted product of all the edges connecting two vertices.

${\min}.{\times}$

minimum of products edge weights connecting two vertices; selects the edge with smallest weighted product of all the edges connecting two vertices.

${\max}.{+}$

maximum of sum of edge weights connecting two vertices; selects the edge with largest weighted sum of all the edges connecting two vertices.

${\min}.{+}$

minimum of sum of edge weights connecting two vertices; selects the edge with smallest weighted sum of all the edges connecting two vertices.

${\max}.{\min}$

maximum of the minimum of weights connecting two vertices; selects the largest of all the shortest connections between two vertices.

${\min}.{\max}$

minimum of the maximum of weights connecting two vertices; selects the smallest of all the largest connections between two vertices.

V Conclusion

Graph construction, a fundamental operation in a data processing pipeline, is typically done by multiplying the incidence array representations of a graph, $\mathbf{E}_{\mathrm{in}}$ and $\mathbf{E}_{\mathrm{out}}$ , to produce an adjacency array of the graph, $\mathbf{A}$ . The mathematical criteria to determine if $\mathbf{A}$ will have the required structure of the adjacency array of the graph over are as follows. Let $V$ be a set with closed binary operations $\oplus,\otimes$ with identities $0,1\in V$ . Then the following are equivalent:

$\oplus$ and $\otimes$ satisfy the properties

(a)

Zero-Sum-Free: $a\oplus b=0$ if and only if $a=b=0$ , 2. (b)

No Zero Divisors: $a\otimes b=0$ if and only if $a=0$ or $b=0$ , and 3. (c)

[math] is Annihilator for $\otimes$ : $a\otimes 0=0\otimes a=0$ . 2. 2.

If $G$ is a graph with out-vertex and in-vertex incidence arrays $\mathbf{E}_{\mathrm{out}}:K{\times}K_{\mathrm{out}}\rightarrow V$ and $\mathbf{E}_{\mathrm{in}}:K{\times}K_{\mathrm{out}}\rightarrow V$ , then $\mathbf{E}_{\mathrm{out}}^{\sf T}\mathbf{E}_{\mathrm{in}}$ is an adjacency array for $G$ .

The values in the resulting adjacency array are determined by the corresponding addition $\oplus$ and multiplication $\otimes$ operations used to perform the array multiplication.

Acknowledgment

The authors would like to thank Paul Burkhardt, Alan Edelman, Sterling Foster, Vijay Gadepally, Sam Madden, Dave Martinez, Tom Mattson, Albert Reuther, Victor Roytburd, and Michael Stonebraker.

Bibliography31

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[Anderson et al 2016] M. Anderson, N. Sundaram, N. Satish, M. Patwary, T. L. Willke, & P. Dubey, Graph Pad: Optimized Graph Primitives for Parallel and Distributed Platforms, submitted
2[Bodin & Kursh 1979] L. Bodin & S. Kursh, A detailed description of a computer system for the routing and scheduling of street sweepers , Computers & Operations Research, 6(4), 181-198, 1979
3[Brualdi 1967] R.A. Brualdi, Kronecker products of fully indecomposable matrices and of ultrastrong digraphs , Journal of Combinatorial Theory, 2:135-139, 1967
4[Bruck & Ryser 1949] R. Bruck & H. Ryser, The nonexistence of certain finite projective planes , Canadian Journal of Mathematics, 1, 88-93, 1949
5[Buluç & Gilbert 2011] A. Buluç & J. Gilbert, The Combinatorial BLAS: Design, implementation, and applications . International Journal of High Performance Computing Applications (IJHPCA), 2011
6[Buluç 2015] A. Buluç, Graph BLAS Special Session, IEEE HPEC 2015, Waltham, MA
7[Dibert et al 2015] K. Dibert, H. Jansen & J. Kepner, Algebraic Conditions for Generating Accurate Adjacency Arrays , IEEE MIT Undergraduate Research Technology Conference, 2015
8[Dobrjanskyj & Freudenstein 1967] L. Dobrjanskyj & F. Freudenstein, Some applications of graph theory to the structural analysis of mechanisms , Journal of Engineering for Industry, 89(1), 153-158, 1967

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Constructing Adjacency Arrays from Incidence Arrays

Abstract

Index Terms:

I Introduction

I-A Definitions

Definition I.1** (Associative Array).**

Definition I.2** (Transpose).**

Definition I.3** (Array Multiplication).**

Definition I.4** (Incidence Arrays).**

Definition I.5** (Adjacency Array).**

II Adjacency Array Construction

Theorem II.1**.**

Proof.

Lemma II.2**.**

Proof.

Lemma II.3**.**

Proof.

Lemma II.4**.**

Proof.

III Adjacency Array of Reverse Graph

Corollary III.1**.**

Proof.

IV Graph Construction with Different Semirings

V Conclusion

Acknowledgment

Definition I.1 (Associative Array).

Definition I.2 (Transpose).

Definition I.3 (Array Multiplication).

Definition I.4 (Incidence Arrays).

Definition I.5 (Adjacency Array).

Theorem II.1.

Lemma II.2.

Lemma II.3.

Lemma II.4.

Corollary III.1.