Reinforcement Learning in BitTorrent Systems
Rafit Izhak-Ratzin, Hyunggon Park, Mihaela van der Schaar

TL;DR
This paper introduces a reinforcement learning-based peer selection mechanism for BitTorrent that enhances fairness, discourages free-riding, and improves system stability through strategic, long-term decision-making.
Contribution
It proposes a novel RL-based peer selection protocol integrated into BitTorrent, modeling peer interactions as a repeated game to optimize long-term utility and fairness.
Findings
Peers using RL improve download times by up to 33%
System stability increases with 57% reduction in peer selection fluctuations
Free-riding decreases by 64% with the new protocol
Abstract
Recent research efforts have shown that the popular BitTorrent protocol does not provide fair resource reciprocation and may allow free-riding. In this paper, we propose a BitTorrent-like protocol that replaces the peer selection mechanisms in the regular BitTorrent protocol with a novel reinforcement learning (RL) based mechanism. Due to the inherent opration of P2P systems, which involves repeated interactions among peers over a long period of time, the peers can efficiently identify free-riders as well as desirable collaborators by learning the behavior of their associated peers. Thus, it can help peers improve their download rates and discourage free-riding, while improving fairness in the system. We model the peers' interactions in the BitTorrent-like network as a repeated interaction game, where we explicitly consider the strategic behavior of the peers. A peer, which applies the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPeer-to-Peer Network Technologies · Caching and Content Delivery · Access Control and Trust
