A class of kernel-based scalable algorithms for data science
Philippe G. LeFloch, Jean-Marc Mercier, and Shohruh Miryusupov

TL;DR
This paper introduces scalable kernel-based algorithms using a divide-and-conquer approach for large datasets, enhancing efficiency in data science tasks like simulation, distribution generation, and AI applications.
Contribution
It presents a simple, robust methodology that scales kernel algorithms for large datasets, applicable to various data science and AI tasks, with performance-based algorithm selection.
Findings
Algorithms efficiently handle large-scale data
Applicable to supervised, unsupervised, and reinforcement learning
Improves scalability of kernel methods in industrial applications
Abstract
We present several generative and predictive algorithms based on the RKHS (reproducing kernel Hilbert spaces) methodology, which, most importantly, are scale up efficiently with large datasets or high-dimensional data. It is well recognized that the RKHS methodology leads one to efficient and robust algorithms for numerous tasks in data science, statistics, and scientific computation. However, the implementations existing the literature are often difficult to scale up for encompassing large datasets. In this paper, we introduce a simple and robust, divide-and-conquer methodology. It applies to large scale datasets and relies on several kernel-based algorithms, which distinguish between various extrapolation, interpolation, and optimal transport steps. We argue how to select the suitable algorithm in specific applications thanks to a feedback of performance criteria. Our primary focus is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and Data Classification · Advanced Clustering Algorithms Research
