# Net2Vis -- A Visual Grammar for Automatically Generating   Publication-Tailored CNN Architecture Visualizations

**Authors:** Alex B\"auerle, Christian van Onzenoodt, and Timo Ropinski

arXiv: 1902.04394 · 2021-02-11

## TL;DR

This paper introduces Net2Vis, a visual grammar and automated tool for generating publication-ready CNN architecture visualizations from Keras models, reducing time and improving clarity in scientific publications.

## Contribution

The paper presents a novel visual grammar for CNNs derived from real publication figures and an automated system to generate standardized visualizations from Keras models.

## Key findings

- Reduces time to create CNN visualizations for publications.
- Provides a unified, unambiguous visualization style.
- Validated through expert feedback and quantitative study.

## Abstract

To convey neural network architectures in publications, appropriate visualizations are of great importance. While most current deep learning papers contain such visualizations, these are usually handcrafted just before publication, which results in a lack of a common visual grammar, significant time investment, errors, and ambiguities. Current automatic network visualization tools focus on debugging the network itself and are not ideal for generating publication visualizations. Therefore, we present an approach to automate this process by translating network architectures specified in Keras into visualizations that can directly be embedded into any publication. To do so, we propose a visual grammar for convolutional neural networks (CNNs), which has been derived from an analysis of such figures extracted from all ICCV and CVPR papers published between 2013 and 2019. The proposed grammar incorporates visual encoding, network layout, layer aggregation, and legend generation. We have further realized our approach in an online system available to the community, which we have evaluated through expert feedback, and a quantitative study. It not only reduces the time needed to generate network visualizations for publications, but also enables a unified and unambiguous visualization design.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.04394/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/1902.04394/full.md

## References

65 references — full list in the complete paper: https://tomesphere.com/paper/1902.04394/full.md

---
Source: https://tomesphere.com/paper/1902.04394