Data Jamboree: A Party of Open-Source Software Solving Real-World Data Science Problems
Lucy D'Agostino McGowan, Shannon Tass, Sam Tyner, HaiYing Wang, Jun, Yan

TL;DR
The paper introduces the Data Jamboree, an event that combines open-source tools and collaborative learning to enhance practical data science skills across diverse participants.
Contribution
It presents a novel format for data science education that integrates computational methods with traditional techniques through live, hands-on workshops.
Findings
Julia outperforms in computational efficiency
Python offers high versatility for various tasks
R excels in statistical analysis and visualization
Abstract
The evolving focus in statistics and data science education highlights the growing importance of computing. This paper presents the Data Jamboree, a live event that combines computational methods with traditional statistical techniques to address real-world data science problems. Participants, ranging from novices to experienced users, followed workshop leaders in using open-source tools like Julia, Python, and R to perform tasks such as data cleaning, manipulation, and predictive modeling. The Jamboree showcased the educational benefits of working with open data, providing participants with practical, hands-on experience. We compared the tools in terms of efficiency, flexibility, and statistical power, with Julia excelling in performance, Python in versatility, and R in statistical analysis and visualization. The paper concludes with recommendations for designing similar events to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
