RETENTION: Resource-Efficient Tree-Based Ensemble Model Acceleration with Content-Addressable Memory

Yi-Chun Liao; Chieh-Lin Tsai; Yuan-Hao Chang; Cam\'elia Slimani; Jalil Boukhobza; and Tei-Wei Kuo

arXiv:2506.05994·cs.LG·February 5, 2026

RETENTION: Resource-Efficient Tree-Based Ensemble Model Acceleration with Content-Addressable Memory

Yi-Chun Liao, Chieh-Lin Tsai, Yuan-Hao Chang, Cam\'elia Slimani, Jalil Boukhobza, and Tei-Wei Kuo

PDF

Open Access

TL;DR

RETENTION introduces a resource-efficient framework that significantly reduces memory requirements for accelerating tree-based ensemble models using content-addressable memory, maintaining high accuracy.

Contribution

It proposes an iterative pruning algorithm and innovative data placement strategies to minimize CAM capacity for tree-based models, especially bagging-based ones.

Findings

01

CAM capacity reduced by up to 207x with less than 3% accuracy loss

02

Tree mapping scheme alone reduces CAM capacity by up to 21x

03

RETENTION achieves significant resource savings for tree model acceleration

Abstract

Although deep learning has demonstrated remarkable capability in learning from unstructured data, modern tree-based ensemble models remain superior in extracting relevant information and learning from structured datasets. While several efforts have been made to accelerate tree-based models, the inherent characteristics of the models pose significant challenges for conventional accelerators. Recent research leveraging content-addressable memory (CAM) offers a promising solution for accelerating tree-based models, yet existing designs suffer from excessive memory consumption and low utilization. This work addresses these challenges by introducing RETENTION, an end-to-end framework that significantly reduces CAM capacity requirement for tree-based model inference. We propose an iterative pruning algorithm with a novel pruning criterion tailored for bagging-based models (e.g., Random…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Packet Processing and Optimization · Advanced Graph Neural Networks · Advanced Neural Network Applications

MethodsClass-activation map · Pruning