A Fast General-Purpose Clustering Algorithm Based on FPGAs for High-Throughput Data Processing
A. Annovi, M. Beretta

TL;DR
This paper introduces a fast, FPGA-based clustering algorithm designed for high-throughput data processing, capable of real-time operation in high-density environments like high-energy physics detectors.
Contribution
It presents a novel linear-scaling clustering algorithm that can be implemented in FPGAs, enabling real-time, high-throughput clustering across various scientific fields.
Findings
Linear processing time scales with data size
Suitable for high-density, high-luminosity data environments
Applicable to multiple scientific and medical imaging fields
Abstract
We present a fast general-purpose algorithm for high-throughput clustering of data "with a two dimensional organization". The algorithm is designed to be implemented with FPGAs or custom electronics. The key feature is a processing time that scales linearly with the amount of data to be processed. This means that clustering can be performed in pipeline with the readout, without suffering from combinatorial delays due to looping multiple times through all the data. This feature makes this algorithm especially well suited for problems where the data has high density, e.g. in the case of tracking devices working under high-luminosity condition such as those of LHC or Super-LHC. The algorithm is organized in two steps: the first step (core) clusters the data; the second step analyzes each cluster of data to extract the desired information. The current algorithm is developed as a clustering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
