Medusa: An Efficient Cloud Fault-Tolerant MapReduce
Pedro A.R.S. Costa, Xiao Bai, Fernando M.V. Ramos, Miguel Correia

TL;DR
Medusa is a platform that extends MapReduce to multi-cloud environments, providing robust fault tolerance against various failures, including malicious attacks, without modifying existing frameworks or applications.
Contribution
It introduces a fault-tolerant multi-cloud MapReduce system that is transparent, compatible with Hadoop, and capable of handling arbitrary and malicious faults efficiently.
Findings
Significantly reduces execution time compared to traditional fault-tolerance methods.
Successfully tolerates arbitrary faults, cloud outages, and malicious insider attacks.
Operates transparently without requiring modifications to user applications or Hadoop.
Abstract
Applications such as web search and social networking have been moving from centralized to decentralized cloud architectures to improve their scalability. MapReduce, a programming framework for processing large amounts of data using thousands of machines in a single cloud, also needs to be scaled out to multiple clouds to adapt to this evolution. The challenge of building a multi-cloud distributed architecture is substantial. Notwithstanding, the ability to deal with the new types of faults introduced by such setting, such as the outage of a whole datacenter or an arbitrary fault caused by a malicious cloud insider, increases the endeavor considerably. In this paper we propose Medusa, a platform that allows MapReduce computations to scale out to multiple clouds and tolerate several types of faults. Our solution fulfills four objectives. First, it is transparent to the user, who writes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
