Database Operations in D4M.jl
Lauren Milechin, Vijay Gadepally, Jeremy Kepner

TL;DR
This paper introduces D4M.jl, a Julia implementation of the D4M framework, extending database capabilities for data analytics and demonstrating comparable or improved performance over the MATLAB version.
Contribution
The paper develops and presents a Julia version of D4M with enhanced database functionalities, matching or surpassing MATLAB performance.
Findings
D4M.jl supports extensive database operations.
Performance benchmarks show D4M.jl is comparable or better than MATLAB D4M.
The implementation facilitates data analytics in Julia with efficient database access.
Abstract
Each step in the data analytics pipeline is important, including database ingest and query. The D4M-Accumulo database connector has allowed analysts to quickly and easily ingest to and query from Apache Accumulo using MATLAB(R)/GNU Octave syntax. D4M.jl, a Julia implementation of D4M, provides much of the functionality of the original D4M implementation to the Julia community. In this work, we extend D4M.jl to include many of the same database capabilities that the MATLAB(R)/GNU Octave implementation provides. Here we will describe the D4M.jl database connector, demonstrate how it can be used, and show that it has comparable or better performance to the original implementation in MATLAB(R)/GNU Octave.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
