Improving Slow Transfer Predictions: Generative Methods Compared

Jacob Taegon Kim; Alex Sim; Kesheng Wu; Jinoh Kim

arXiv:2512.14522·cs.LG·December 17, 2025

Improving Slow Transfer Predictions: Generative Methods Compared

Jacob Taegon Kim, Alex Sim, Kesheng Wu, Jinoh Kim

PDF

TL;DR

This paper compares different data augmentation methods, including generative models, to improve machine learning predictions of slow data transfers in scientific networks, finding limited gains from advanced techniques.

Contribution

It provides a comparative analysis of augmentation strategies, highlighting that complex generative models like CTGAN do not significantly outperform simple sampling in addressing class imbalance.

Findings

01

Augmentation can improve prediction performance.

02

Increasing imbalance ratio yields limited gains.

03

CTGAN does not outperform stratified sampling.

Abstract

Monitoring data transfer performance is a crucial task in scientific computing networks. By predicting performance early in the communication phase, potentially sluggish transfers can be identified and selectively monitored, optimizing network usage and overall performance. A key bottleneck to improving the predictive power of machine learning (ML) models in this context is the issue of class imbalance. This project focuses on addressing the class imbalance problem to enhance the accuracy of performance predictions. In this study, we analyze and compare various augmentation strategies, including traditional oversampling methods and generative techniques. Additionally, we adjust the class imbalance ratios in training datasets to evaluate their impact on model performance. While augmentation may improve performance, as the imbalance ratio increases, the performance does not significantly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.