Data Complexity-aware Deep Model Performance Forecasting

Yen-Chia Chen; Hsing-Kuo Pao; Hanjuan Huang

arXiv:2601.01383·cs.LG·January 6, 2026

Data Complexity-aware Deep Model Performance Forecasting

Yen-Chia Chen, Hsing-Kuo Pao, Hanjuan Huang

PDF

Open Access

TL;DR

This paper introduces a lightweight, two-stage framework for predicting deep model performance before training, leveraging dataset properties and model details to improve model selection and data quality assessment.

Contribution

The proposed method offers a generalizable, efficient approach to forecast model performance pre-training, aiding in architecture selection and dataset evaluation.

Findings

01

Dataset variance can guide model choice.

02

The framework predicts performance accurately across datasets.

03

It helps identify data issues before training begins.

Abstract

Deep learning models are widely used across computer vision and other domains. When working on the model induction, selecting the right architecture for a given dataset often relies on repetitive trial-and-error procedures. This procedure is time-consuming, resource-intensive, and difficult to automate. While previous work has explored performance prediction using partial training or complex simulations, these methods often require significant computational overhead or lack generalizability. In this work, we propose an alternative approach: a lightweight, two-stage framework that can estimate model performance before training given the understanding of the dataset and the focused deep model structures. The first stage predicts a baseline based on the analysis of some measurable properties of the dataset, while the second stage adjusts the estimation with additional information on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning