# On Linear Learning with Manycore Processors

**Authors:** Eliza Wszola, Celestine Mendler-D\"unner, Martin Jaggi, Markus, P\"uschel

arXiv: 1905.00626 · 2021-10-29

## TL;DR

This paper introduces Heterogeneous Tasks on Homogeneous Cores (HTHC), a novel parallelization approach for efficiently training generalized linear models on manycore processors, achieving significant performance improvements.

## Contribution

It proposes a new parallelism scheme tailored for manycore architectures and provides an architecture-aware implementation that outperforms existing methods.

## Key findings

- Achieves up to 10x faster Lasso training on dense data
- Supports dense, sparse, and quantized datasets efficiently
- Outperforms prior software and baseline implementations

## Abstract

A new generation of manycore processors is on the rise that offers dozens and more cores on a chip and, in a sense, fuses host processor and accelerator. In this paper we target the efficient training of generalized linear models on these machines. We propose a novel approach for achieving parallelism which we call Heterogeneous Tasks on Homogeneous Cores (HTHC). It divides the problem into multiple fundamentally different tasks, which themselves are parallelized. For evaluation, we design a detailed, architecture-cognizant implementation of our scheme on a recent 72-core Knights Landing processor that is adaptive to the cache, memory, and core structure. Our library efficiently supports dense and sparse datasets as well as 4-bit quantized data for further possible gains in performance. We show benchmarks for Lasso and SVM with different data sets against straightforward parallel implementations and prior software. In particular, for Lasso on dense data, we improve the state-of-the-art by an order of magnitude.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.00626/full.md

## Figures

38 figures with captions in the complete paper: https://tomesphere.com/paper/1905.00626/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/1905.00626/full.md

---
Source: https://tomesphere.com/paper/1905.00626