# Speeding HEP Analysis with ROOT Bulk I/O

**Authors:** Brian Bockelman (1), Zhe Zhang (2), Oksana Shadura (2) ((1) Morgridge, Institute for Research, (2) University Nebraska-Lincoln)

arXiv: 1906.08169 · 2020-08-26

## TL;DR

This paper introduces a bulk I/O interface for ROOT that significantly boosts data reading performance in HEP analysis workflows by enabling multiple events to be processed simultaneously, integrated into RDataFrame for automatic optimization.

## Contribution

The paper presents a new bulk I/O interface for ROOT, integrated into RDataFrame, that automatically switches to high-performance data reading modes based on data type, improving event rates.

## Key findings

- Order-of-magnitude speedups in microbenchmarks
- Effective automatic interface switching in RDataFrame
- Enhanced event processing for analysis formats like NanoAOD

## Abstract

Distinct HEP workflows have distinct I/O needs; while ROOT I/O excels at serializing complex C++ objects common to reconstruction, analysis workflows typically have simpler objects and can sustain higher event rates. To meet these workflows, we have developed a "bulk I/O" interface, allowing multiple events data to be returned per library call. This reduces ROOT-related overheads and increases event rates - orders-of-magnitude improvements are shown in microbenchmarks. Unfortunately, this bulk interface is difficult to use as it requires users to identify when it is applicable and they still "think" in terms of events, not arrays of data. We have integrated the bulk I/O interface into the new RDataFrame analysis framework inside ROOT. As RDataFrame's interface can provide improved type information, the framework itself can determine what data is readable via the bulk IO and automatically switch between interfaces. We demonstrate how this can improve event rates when reading analysis data formats, such as CMS's NanoAOD.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.08169/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1906.08169/full.md

## References

4 references — full list in the complete paper: https://tomesphere.com/paper/1906.08169/full.md

---
Source: https://tomesphere.com/paper/1906.08169