The MeLa BitChute Dataset
Milo Trujillo, Maur\'icio Gruppi, Cody Buntain, Benjamin D. Horne

TL;DR
This paper introduces the MeLa-BitChute dataset, a comprehensive collection of over 3 million videos and metadata from BitChute, enabling research on alternative social video platforms.
Contribution
The paper provides a large-scale, near-complete dataset of BitChute videos and metadata, facilitating analysis of content and user behavior on alternative video hosting sites.
Findings
Dataset includes over 3 million videos from 61,000 channels.
Metadata encompasses comments, descriptions, and view counts.
Dataset available for public research at Harvard Dataverse.
Abstract
In this paper we present a near-complete dataset of over 3M videos from 61K channels over 2.5 years (June 2019 to December 2021) from the social video hosting platform BitChute, a commonly used alternative to YouTube. Additionally, we include a variety of video-level metadata, including comments, channel descriptions, and views for each video. The MeLa-BitChute dataset can be found at: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/KRD1VS.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
