Incentivizing Permissionless Distributed Learning of LLMs

Joel Lidin; Amir Sarfi; Evangelos Pappas; Samuel Dare; Eugene Belilovsky; Jacob Steeves

arXiv:2505.21684·cs.LG·May 29, 2025

Incentivizing Permissionless Distributed Learning of LLMs

Joel Lidin, Amir Sarfi, Evangelos Pappas, Samuel Dare, Eugene Belilovsky, Jacob Steeves

PDF

Open Access 1 Models

TL;DR

This paper presents Gauntlet, an incentive system deployed on the bittensor blockchain that enables permissionless distributed training of large language models, rewarding peer contributions and ensuring fair participation.

Contribution

The paper introduces Gauntlet, a novel incentive mechanism for permissionless distributed deep learning of LLMs, including a live deployment training a 1.2B model.

Findings

01

Successfully trained a 1.2B parameter LLM using permissionless pseudo-gradient contributions.

02

Implemented a real-world incentive system that rewards participants with tokens based on contribution value.

03

Demonstrated the effectiveness of the filtering and evaluation mechanisms in a live blockchain environment.

Abstract

We describe an incentive system for distributed deep learning of foundational models where peers are rewarded for contributions. The incentive system, \textit{Gauntlet}, has been deployed on the bittensor blockchain and used to train a 1.2B LLM with completely permissionless contributions of pseudo-gradients: no control over the users that can register or their hardware. \textit{Gauntlet} can be applied to any synchronous distributed training scheme that relies on aggregating updates or pseudo-gradients. We rely on a two-stage mechanism for fast filtering of peer uptime, reliability, and synchronization, combined with the core component that estimates the loss before and after individual pseudo-gradient contributions. We utilized an OpenSkill rating system to track competitiveness of pseudo-gradient scores across time. Finally, we introduce a novel mechanism to ensure peers on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
1Covenant/TEMPLAR-I
model· 4 dl· ♡ 4
4 dl♡ 4

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Advanced Graph Neural Networks · Adversarial Robustness in Machine Learning