EMLIO: Minimizing I/O Latency and Energy Consumption for Large-Scale AI Training

Hasibul Jamil; MD S Q Zulkar Nine; Tevfik Kosar

arXiv:2508.11035·cs.DC·August 18, 2025

EMLIO: Minimizing I/O Latency and Energy Consumption for Large-Scale AI Training

Hasibul Jamil, MD S Q Zulkar Nine, Tevfik Kosar

PDF

TL;DR

EMLIO is a scalable I/O system that significantly reduces latency and energy consumption for large-scale AI training by optimizing data loading over various network environments.

Contribution

We introduce EMLIO, a novel I/O service that minimizes both latency and energy use in distributed AI workloads, addressing a critical gap in current data-loading systems.

Findings

01

Up to 8.6X faster I/O performance

02

10.9X reduction in energy consumption

03

Consistent performance across local, LAN, and WAN environments

Abstract

Large-scale deep learning workloads increasingly suffer from I/O bottlenecks as datasets grow beyond local storage capacities and GPU compute outpaces network and disk latencies. While recent systems optimize data-loading time, they overlook the energy cost of I/O - a critical factor at large scale. We introduce EMLIO, an Efficient Machine Learning I/O service that jointly minimizes end-to-end data-loading latency T and I/O energy consumption E across variable-latency networked storage. EMLIO deploys a lightweight data-serving daemon on storage nodes that serializes and batches raw samples, streams them over TCP with out-of-order prefetching, and integrates seamlessly with GPU-accelerated (NVIDIA DALI) preprocessing on the client side. In exhaustive evaluations over local disk, LAN (0.05 ms & 10 ms RTT), and WAN (30 ms RTT) environments, EMLIO delivers up to 8.6X faster I/O and 10.9X…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.