PreNeT: Leveraging Computational Features to Predict Deep Neural Network   Training Time

Alireza Pourali; Arian Boukani; Hamzeh Khazaei

arXiv:2412.15519·cs.LG·December 30, 2024

PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time

Alireza Pourali, Arian Boukani, Hamzeh Khazaei

PDF

Open Access 1 Repo

TL;DR

PreNeT is a predictive framework that accurately estimates deep neural network training times across different hardware setups by analyzing computational features, helping optimize training configurations and reduce costs.

Contribution

It introduces PreNeT, a novel method integrating detailed computational metrics to improve training time prediction accuracy for diverse hardware architectures.

Findings

01

Achieves up to 72% improvement in prediction accuracy

02

Effectively predicts training duration on new hardware architectures

03

Enhances existing prediction methods with detailed layer analysis

Abstract

Training deep learning models, particularly Transformer-based architectures such as Large Language Models (LLMs), demands substantial computational resources and extended training periods. While optimal configuration and infrastructure selection can significantly reduce associated costs, this optimization requires preliminary analysis tools. This paper introduces PreNeT, a novel predictive framework designed to address this optimization challenge. PreNeT facilitates training optimization by integrating comprehensive computational metrics, including layer-specific parameters, arithmetic operations and memory utilization. A key feature of PreNeT is its capacity to accurately predict training duration on previously unexamined hardware infrastructures, including novel accelerator architectures. This framework employs a sophisticated approach to capture and analyze the distinct…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pacslab/prenet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Neural Networks and Applications