Distributed data analytics
Richard Mortier, Hamed Haddadi, Sandra Servia, Liang Wang

TL;DR
Distributed data analytics enables privacy-preserving, energy-efficient, and low-latency machine learning by shifting computation from centralized cloud servers to edge devices, leveraging increased device capabilities.
Contribution
This paper discusses recent developments, benefits, and challenges of distributed data analytics in contrast to traditional centralized approaches.
Findings
Enhanced privacy through local data processing
Reduced data transfer improves energy efficiency
Lower latency in service interactions
Abstract
Machine Learning (ML) techniques have begun to dominate data analytics applications and services. Recommendation systems are a key component of online service providers. The financial industry has adopted ML to harness large volumes of data in areas such as fraud detection, risk-management, and compliance. Deep Learning is the technology behind voice-based personal assistants, etc. Deployment of ML technologies onto cloud computing infrastructures has benefited numerous aspects of our daily life. The advertising and associated online industries in particular have fuelled a rapid rise the in deployment of personal data collection and analytics tools. Traditionally, behavioural analytics relies on collecting vast amounts of data in centralised cloud infrastructure before using it to train machine learning models that allow user behaviour and preferences to be inferred. A contrasting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · IoT and Edge/Fog Computing · Mobile Crowdsensing and Crowdsourcing
Methodstravel james
