# AlphaTwirl: A Python library for summarizing event data into   multivariate categorical data

**Authors:** Tai Sakuma

arXiv: 1905.06609 · 2019-09-20

## TL;DR

AlphaTwirl is a Python library that efficiently summarizes large event data into multivariate categorical data, facilitating analysis and visualization with flexible, customizable, and concurrent processing capabilities.

## Contribution

It introduces a novel Python library that generalizes histograms into multivariate categorical data, supporting flexible analysis, visualization, and parallel processing.

## Key findings

- Supports concurrent processing with multiple cores.
- Enables easy import into R and pandas for analysis.
- Provides customizable and extensible data summarization.

## Abstract

AlphaTwirl is a Python library that summarizes large event data into multivariate categorical data, which can be regarded as generalizations of histograms. The output can be imported as data frames in R and pandas. With their rich set of data wrangling tools, users can develop flexible and configurable analysis code. The multivariate categorical data loaded as data frames are readily visualized by graphic tools available in R and Python. AlphaTwirl can process event data concurrently with multiple cores or batch systems. Users can extend and customize nearly any functionality of AlphaTwirl with reusable code. AlphaTwirl is released under the BSD license.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.06609/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/1905.06609/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/1905.06609/full.md

---
Source: https://tomesphere.com/paper/1905.06609