# Loklak - A Distributed Crawler and Data Harvester for Overcoming Rate   Limits

**Authors:** Sudheesh Singanamalla, Michael Peter Christen

arXiv: 1704.03624 · 2017-04-13

## TL;DR

Loklak is an open source distributed crawler designed to collect social media data from platforms like Twitter and Weibo, overcoming rate limits and authentication barriers to support research.

## Contribution

It introduces a peer-to-peer distributed crawling system that overcomes social network rate limits and authentication barriers for continuous data collection.

## Key findings

- Enables continuous data collection from social networks.
- Overcomes rate limits and authentication barriers.
- Provides an open data repository for research.

## Abstract

Modern social networks have become sources for vast quantities of data. Having access to such big data can be very useful for various researchers and data scientists. In this paper we describe Loklak, an open source distributed peer to peer crawler and scraper for supporting such research on platforms like Twitter, Weibo and other social networks. Social networks such as Twitter and Weibo pose various limitations to the user on the rate at which one could freely collect such data for research. Our crawler enables researchers to continuously collect data while overcoming the barriers of authentication and rate limits imposed to provide a repository of open data as a service.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1704.03624/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/1704.03624/full.md

---
Source: https://tomesphere.com/paper/1704.03624