# SpliDT: Partitioned Decision Trees for Scalable Stateful Inference at Line Rate

**Authors:** Murayyiam Parvez, Annus Zulfiqar, Roman Beltiukov, Shir Landau Feibish, Walter Willinger, Arpit Gupta, Muhammad Shahbaz

arXiv: 2509.00397 · 2025-09-03

## TL;DR

SPLIDT introduces a partitioned decision tree system for programmable data planes that enables scalable, real-time inference by efficiently reusing resources and optimizing feature allocation, significantly improving accuracy and scalability.

## Contribution

It proposes a novel partitioned decision tree architecture with in-band control and a custom training framework, enabling scalable in-network inference with higher accuracy.

## Key findings

- Supports up to 5x more features than prior systems
- Maintains low time-to-detection comparable to existing methods
- Scales to millions of flows with minimal recirculation overhead

## Abstract

Machine learning (ML) is increasingly being deployed in programmable data planes (switches and SmartNICs) to enable real-time traffic analysis, security monitoring, and in-network decision-making. Decision trees (DTs) are particularly well-suited for these tasks due to their interpretability and compatibility with data-plane architectures, i.e., match-action tables (MATs). However, existing in-network DT implementations are constrained by the need to compute all input features upfront, forcing models to rely on a small, fixed set of features per flow. This significantly limits model accuracy and scalability under stringent hardware resource constraints.   We present SPLIDT, a system that rethinks DT deployment in the data plane by enabling partitioned inference over sliding windows of packets. SPLIDT introduces two key innovations: (1) it assigns distinct, variable feature sets to individual sub-trees of a DT, grouped into partitions, and (2) it leverages an in-band control channel (via recirculation) to reuse data-plane resources (both stateful registers and match keys) across partitions at line rate. These insights allow SPLIDT to scale the number of stateful features a model can use without exceeding hardware limits. To support this architecture, SPLIDT incorporates a custom training and design-space exploration (DSE) framework that jointly optimizes feature allocation, tree partitioning, and DT model depth. Evaluation across multiple real-world datasets shows that SPLIDT achieves higher accuracy while supporting up to 5x more stateful features than prior approaches (e.g., NetBeacon and Leo). It maintains the same low time-to-detection (TTD) as these systems, while scaling to millions of flows with minimal recirculation overhead (<0.05%).

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2509.00397/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/2509.00397/full.md

## References

85 references — full list in the complete paper: https://tomesphere.com/paper/2509.00397/full.md

---
Source: https://tomesphere.com/paper/2509.00397