Astronomy in the Cloud: Using MapReduce for Image Coaddition
Keith Wiley, Andrew Connolly, Jeff Gardner, Simon Krughof, Magdalena, Balazinska, Bill Howe, YongChul Kwon, YingYi Bu

TL;DR
This paper explores using Hadoop's MapReduce framework to efficiently perform image coaddition on large-scale astronomical data, enabling scalable processing for upcoming sky surveys.
Contribution
It demonstrates how to adapt image coaddition to MapReduce, optimizing performance for processing multi-terabyte astronomical datasets in cloud environments.
Findings
Achieved scalable image coaddition using Hadoop on SDSS data
Optimized MapReduce implementation for better performance
Validated approach with experimental results on large datasets
Abstract
In the coming decade, astronomical surveys of the sky will generate tens of terabytes of images and detect hundreds of millions of sources every night. The study of these sources will involve computation challenges such as anomaly detection and classification, and moving object tracking. Since such studies benefit from the highest quality data, methods such as image coaddition (stacking) will be a critical preprocessing step prior to scientific investigation. With a requirement that these images be analyzed on a nightly basis to identify moving sources or transient objects, these data streams present many computational challenges. Given the quantity of data involved, the computational load of these problems can only be addressed by distributing the workload over a large number of nodes. However, the high data throughput demanded by these applications may present scalability challenges…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
