Visualizations for Understanding SoC Behaviour

Dave McEwan; Marcin Hlond; Jose Nunez-Yanez

arXiv:1905.06386·eess.SP·May 17, 2019·PRIME

Visualizations for Understanding SoC Behaviour

Dave McEwan, Marcin Hlond, Jose Nunez-Yanez

PDF

TL;DR

This paper presents a new visualization method for analyzing System-on-Chip (SoC) behavior, aiding understanding, optimization, and validation through automated visual representations of component interactions.

Contribution

It introduces a mathematically grounded visualization technique that automates the analysis of component interactions within SoCs, enhancing traditional investigation tools.

Findings

01

Effective visualization of component interactions demonstrated in case studies

02

Method aids in system optimization, monitoring, and validation

03

Automates analysis process, reducing manual effort

Abstract

This paper introduces a novel method of analysis for SoC development building upon commonly used tools and techniques to approximate and automate the human process of investigation. Knowledge of the interactions between components within a SoC is essential for understanding how a system works so the presented method provides a way of visualizing these interactions. The mathematical basis for the method is explained and justified, then the method is demonstrated using two representative case studies. Visualizations from the case studies are used to exhibit the usefulness of the method for system optimization, monitoring, and validation.

Equations20

w (t)

w (t)

E [f_{i}]

Pr (X ∣ Y)

Pr (X ∣ Y)

E [f_{x} ∣ f_{y}]

X ⊥ ⊥ Y

let φ

\dot{D} ep (f_{x}, f_{y})

cov (X, Y)

cov (X, Y)

X, Y \in [0, 1]

let φ

\dot{C} ov (f_{x}, f_{y})

θ

θ

ϕ

= ⌊ 255 \times θ ⌋

= ⌊ 255 \times θ^{m a x (0, \frac{π}{4} - ϕ) + 1} ⌋

= ⌊ 255 \times θ^{m a x (0, ϕ - \frac{π}{4}) + 1} ⌋

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

\fancyrefaddcaptions

english \frefformatplainlst\freflstname\fancyrefdefaultspacing#1 \Frefformatplainlst\Freflstname\fancyrefdefaultspacing#1 \frefformatvariolst\freflstname\fancyrefdefaultspacing#1#3 \Frefformatvariolst\Freflstname\fancyrefdefaultspacing#1#3

Visualizations for Understanding SoC Behaviour

††thanks: This project is supported by the Engineering and Physical Sciences Research Council (EP/I028153/ and EP/L016656/1); the University of Bristol and UltraSoC Technologies Ltd. Supervised by Dr Jose Nunez-Yanez and Professor Kerstin Eder.

Dave McEwan

Centre for Doctoral Training in Communications

*University of Bristol

*Bristol, UK

[email protected]

Marcin Hlond

Director of System Engineering

*UltraSoC Technologies Ltd

*Bristol, UK

[email protected]

Dr Jose Nunez-Yanez

Dept. of Microelectronics

*University of Bristol

*Bristol, UK

[email protected]

Abstract

This paper introduces a novel method of analysis for System-on-Chip (SoC) development building upon commonly used tools and techniques to approximate and automate the human process of investigation. Knowledge of the interactions between components within a SoC is essential for understanding how a system works so the presented method provides a way of visualizing these interactions. The mathematical basis for the method is explained and justified, then the method is demonstrated using two representative case studies. Visualizations from the case studies are used to exhibit the usefulness of the method for system optimization, monitoring, and validation.

I Introduction

The SoC s comprising modern silicon products are often built with the result of lifetime’s of work by hundreds of engineers which makes it all but impossible for a single systems architect to have a complete understanding of every part in a design. This means that while at first glance a system may appear to be functioning, unforeseen behaviours may appear within the interactions between system components, possibly leading to undesirable behaviour such as reduced performance, increased energy usage, information leakage, or unexpected susceptiblity to faults. UltraSoC Technologies Ltd (UltraSoC) is a silicon IP supplier specializing in embedded analytics which uses highly configurable monitoring components to address these issues. Analysing so much data using traditional methods such as assertions is difficult and time consuming due to systemic complexity and dynamic behaviour. In this research we propose a visual tool and mathematical framework that can help to understand these behaviours and build upon the instrumentation technology developed by project partner UltraSoC. Two cases studies are used to illustrate the methodology: A synthesizable model of a simple SoC, and a complex SoC with lightweight software instrumentation enabled by the use of UltraSoC tools.

The main contributions of this paper are:

A mathematical framework for approximating the human process of investigation for binarized time series SoC data.
A novel visualization technique for behavioural relationships.

II Previous Work

An examination of currently available hardware and low-level software profiling methods is given by Lagraa[1] which covers well known techniques such as using counters to generate statistics about both hardware and software events – effectively a low cost data compression. Lagraa’s thesis is based on profiling SoC s created specifically on Xilinx MP-SoC devices, which although powerful, ensures it may not be applied to data from other sources, such as in post-silicon. Lo et al[2] described a system for describing behaviour with a series of statements using a search space exploration process based on boolean set theory. While this work has a similar goal of finding temporal dependencies it is acknowledged that the mining method does not perform adequately for the very long traces often found in real-world SoC data. Another limitaton here is our receptiveness to information in the form of long lists of statements versus a visual representation. Ivanovic et al[3] review time series analysis models and methods where characteristic features of economic time series are described such as high auto-dependence and inter-dependence, high correlation, non-stationarity, and drawn from noisy sources. SoC data is expected to have these same features, together with full binarization and much greater length. Explainability is a key requirement to understanding so related approaches such as the use of Neural Network (NN) s has been avoided at this stage although these may be useful for higher level analysis.

III Methodology

This methodology has been designed to approximate and mimic the process of an experienced SoC engineer trying to understand how waveforms are related to each other. It is assumed that the measurements are taken at discrete times $t$ , often referred to as a number of clock cycles, and that all values are binary, $f_{i}(t)\in\{0,1\}$ . Additionally it is assumed that data for every time is able to be recorded, or accurately inferred, which depends on the changes in measurement values to be sparse for storing data at a physically feasible rate.

The first approximation utilized is the application of a bell-shaped windowing function $w$ which is similar to how we focus attention on the centre of a time period $t\in[u,v)$ . A power-of-sine window windowing function $w$ is used to create a weighted average of each measurement giving the expected value.

[TABLE]

Bayes theorem in \Frefeq:bayes and the definition of independence in \Frefeq:independence to allow the conditional expectation and a measure of dependency, $\operatorname{\dot{D}ep}$ , to be calculated. In order to reduce the amount of information stored, a threshold is introduced which approximates the process of putting relationships in natural language form.

[TABLE]

$\operatorname{\dot{C}ov}$ is a measure of covariance as shown in \Frefeq:def_Cov. Here the use of a weighted average and a threshold function approximate the human processes of focusing attention and discarding pairwise correlations which are insignificantly small or negative.

[TABLE]

The measures $\operatorname{\dot{D}ep}$ , and $\operatorname{\dot{C}ov}$ are symmetric, i.e. $\operatorname{\dot{D}ep}(X,Y)=\operatorname{\dot{D}ep}(Y,X)$ , share the same codomain $[0,1]$ and operate in the same domain $[0,1]$ which allows their output to form new measurements for a meta-analysis.

Each measurement lends itself to implied relationships, e.g. “X high leads or …” or “X rising leads to …”. This methodology is focused on binary measurements so four implied measurements are considered:

Measurement $f(t)\in[0,1]$ . 2. 2.

Reflection, ${\neg}(t):=(1-f)$ . 3. 3.

Rising edge, ${\uparrow}(t):={\max}\big{(}0,f(t)-f(t-1)\big{)}$ . 4. 4.

Falling edge, ${\downarrow}(t):={\max}\big{(}0,\neg(t)-\neg(t-1)\big{)}$ .

Using these four implied measurements for each real one means that the use of thresholds to effectively discard negative dependencies and covariances does not miss significant relationships, but puts them into the more natural form e.g. “X low …” vs “X not high …”. The technique used to visualize some information about each measurement in a single time window combines these into a quad, as shown in \Freffig:own_net. This allows some important information to be gleaned from just the colour of the quad sections, allowing some usage even in a blurred, low-resolution or faraway view where the text is unclear.

A novel visualization has been developed in order to represent data points in the space $[0,1]^{2}$ which allows the viewer to quickly determine the rough location of a point from its colour. \Frefeq:cs0_theta thru \Frefeq:cs0_B show the mapping from $[0,1]^{2}$ to 8-bit Red/Green/Blue (RGB) values as depicted in \Freffig:cs0. This colourspace displays equally well on screen and printed paper and looks similar to people with protanopia, the most common form of colourblindness.

When looking for a pairwise relationship it is necessary to look slightly forward or backward in time to find a result such as “X is likely 5 cycles after Y”. The notation $f_{i_{\langle\delta\rangle}}(t):=f_{i}(t+\delta)$ has been used to represent the notion of measurement $i$ being shifted by $\delta$ cycles. A link between two measurements $f_{x}$ and $f_{y_{\langle\delta\rangle}}$ is said to be significant when both $\operatorname{\dot{D}ep}(f_{x},f_{y_{\langle\delta\rangle}})$ and $\operatorname{\dot{C}ov}(f_{x},f_{y_{\langle\delta\rangle}})$ are greater than zero. For a high level understanding, knowing that a significant link exists is more important than knowing the exact values of dependency and covariance, and a rough estimate may often be good enough. By arranging all measurements in a circle and drawing edges, again using the colourspace described above, a network of behavioural relationships is formed, as shown in the diagrams generated from case studies \Freffig:eg_probsys_net and \Freffig:eg_tinn_net. This arrangement shows behaviour as connections of a graph with node attributes providing a summary of the measured behaviour in an easily digestible manner, ultimately saving engineering time by automating a large part of the human analysis process.

[TABLE]

IV Case Study 1

The first experiment named probsys is based on a system consisting of a single Advanced/ARM eXtensible Interface (AXI) [4] master communicating with a single AXI slave. To give some familiar context three additional binary states are measured on the slave component (busy, stall, and idle). A simulation is run to produce a Value Change Dump (VCD) file containing measurment data. The rates at which transactions are made on the five AXI channels (AW, W, B, AR, R) are controlled with a probability distributon via fixed inputs. On the source side stall this means dropping *READY and on the destination side this means dropping *VALID. Observing the measurements in a waveform viewer in order to validate system behaviour is not trivial due to the density and format of information. Some expected behaviours may be expressed as formal properties and proven. E.g. Assertions may be used to check that busy and idle are never high at the same time. However, listing and forming all of these properties is time consuming and gives no hints to what behaviours a user might have forgotten to specify.

This visualization can be inspected interactively and intuitively to find more information at deeper levels such as the precise conditional probability, dependency, and covariance of each edge. Even at the static and low resolution of printed paper, much useful information is immediately available with its significance denoted by the darkness of the ink. The edges correctly identify the measurements which are expected to be related to each other, e.g. AXI read replies strongly related to AXI read requests. It can be quickly seen that the read channels are not interacting with the write channels, validating (or invalidating) expectations. This gives the system designer some confidence that the read and write parts of the system have not been accidentally linked through some unexpected mechanism. This example is analogous to the many use-cases where it is desired to visualize the cooperation between channels, non-cooperation between channels, or iteraction with a side-channel like busy.

V Case Study 2

A system which is more representative of the complex SoC s used today is required to demonstrate the method in a real-world application. The application chosen for this experiment is a software-based NN performing handwritten-digit recognition, running on a standard prototyping FPGA SoC [5] with two RISC-V processors and communicating with the outside world over USB. The C software of 17 functions running on 2 processors, is based on Tinn[6][7] and modified such that the smaller processor ACPU collects batches and passes them to the processor which has the full floating point unit SCPU.

Ease of instrumentation is essential for real world systems as non-standard modifications can alter observed behaviour is subtle but important ways. The only modifications required are an additional GCC flag in the compile stage (-finstrument-functions), and to optionally disable instrumentation on uninteresting functions[8]. Network diagrams were generated using the methodology described above, two of which are shown in \Freffig:eg_tinn_net. Since each of these diagrams corresponds to a single time window it is natural to view them in sequence like a movie. Using the Scalable Vector Graphics (SVG) image format also allows the diagrams to be browsed and examined interactively to extract more detail and exact values as desired. Nodes with darker colours in the centre bands indicate more time spent in those functions. E.g. SCPU spends most of its time it the train function and ACPU spends most of its time waiting during the training phase in \Freffig:eg_tinn_net_train.

VI Conclusion

Although the experiment in \Frefsec:probsys may be a proof of concept example the case of monitoring transactions on multiple communication channels to ensure they are (or are not) cooperating is not uncommon where traditional tools are often unable to provide useful visualizations. Similarly, the experiment in \Frefsec:tinn represents a realistic use case where an engineer is able to quickly, and with minimal manual effort, gain useful knowledge about the workings of the software, without looking at the source code. These case studies and the example results shown in the figures demonstrate that the methodology may be applied to a wide variety of situations in order to aid the understanding and validation of complex systems. Further work is being done with additional case studies which combine hardware probes such as those in the probsys experiment with software instrumentation such as that in the tinn experiment.

Bibliography8

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S. Lagraa, New MP-So C profiling tools based on data mining techniques . Ph D thesis, L’Université de Grenoble, 2014.
2[2] D. Lo, S.-C. Khoo, and C. Liu, “Mining past-time temporal rules from execution traces,” ACM Workshop On Dynamic Analysis , pp. 50–56, Jul 2008.
3[3] M. Ivanovic and V. Kurbalija, “Time series analysis and possible applications,” 39th International Convention on Information and Communication Technology, Electronics and Microelectronics , pp. 473–479, 2016.
4[4] ARM Limited, AMBA AXI and ACE Protocol Specification , 2011.
5[5] Ultra So C Technologies Ltd, UL-00231-TC-F-Taygete Prototype , Dec 2018.
6[6] G. Louw, “Tinn the tiny neural network library.” https://github.com/glouw/tinn.
7[7] Semeion Research Center of Sciences of Communication and Tattile, “Semeion handwritten digit data set.” https://archive.ics.uci.edu/ml/datasets/semeion+handwritten+digit.
8[8] Ultra So C Technologies Ltd, UL-001174-TR-3B-Static Instrumentation User Guide , Dec 2018.