TL;DR
TreeLUT introduces an FPGA-based implementation of gradient boosted decision trees that achieves high efficiency, low latency, and improved hardware utilization, serving as a compelling alternative to deep neural networks for inference acceleration.
Contribution
It presents TreeLUT, an open-source FPGA tool that efficiently implements GBDTs with a novel quantization scheme and pipelined architecture, outperforming existing DNN and GBDT methods.
Findings
Significantly improves hardware utilization, latency, and throughput.
Achieves competitive accuracy with existing methods.
Demonstrates effectiveness on multiple classification datasets.
Abstract
Accelerating machine learning inference has been an active research area in recent years. In this context, field-programmable gate arrays (FPGAs) have demonstrated compelling performance by providing massive parallelism in deep neural networks (DNNs). Neural networks (NNs) are computationally intensive during inference, as they require massive amounts of multiplication and addition, which makes their implementations costly. Numerous studies have recently addressed this challenge to some extent using a combination of sparsity induction, quantization, and transformation of neurons or sub-networks into lookup tables (LUTs) on FPGAs. Gradient boosted decision trees (GBDTs) are a high-accuracy alternative to DNNs in a wide range of regression and classification tasks, particularly for tabular datasets. The basic building block of GBDTs is a decision tree, which resembles the structure of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
