Multi-Resolution Model Fusion for Accelerating the Convolutional Neural Network Training

Kewei Wang; Claire Songhyun Lee; Sunwoo Lee; Vishu Gupta; Jan Balewski; Alex Sim; Peter Nugent; Ankit Agrawal; Alok Choudhary; Kesheng Wu; Wei-keng Liao

arXiv:2510.25170·cs.DC·October 30, 2025

Multi-Resolution Model Fusion for Accelerating the Convolutional Neural Network Training

Kewei Wang, Claire Songhyun Lee, Sunwoo Lee, Vishu Gupta, Jan Balewski, Alex Sim, Peter Nugent, Ankit Agrawal, Alok Choudhary, Kesheng Wu, Wei-keng Liao

PDF

TL;DR

This paper introduces a multi-resolution model fusion method that accelerates CNN training by combining models trained on lower resolutions with high-resolution finetuning, significantly reducing training time without sacrificing accuracy.

Contribution

The paper proposes a novel multi-resolution fusion approach that speeds up CNN training by leveraging quick-to-generate lower-resolution models before final high-resolution refinement.

Findings

01

Training time reduced by up to 47% and 44% in two scientific applications.

02

Model accuracy remains unchanged despite reduced training time.

03

Method is effective for large high-dimensional data in scientific research.

Abstract

Neural networks are rapidly gaining popularity in scientific research, but training the models is often very time-consuming. Particularly when the training data samples are large high-dimensional arrays, efficient training methodologies that can reduce the computational costs are crucial. To reduce the training cost, we propose a Multi-Resolution Model Fusion (MRMF) method that combines models trained on reduced-resolution data and then refined with data in the original resolution. We demonstrate that these reduced-resolution models and datasets could be generated quickly. More importantly, the proposed approach reduces the training time by speeding up the model convergence in each fusion stage before switching to the final stage of finetuning with data in its original resolution. This strategy ensures the final model retains high-resolution insights while benefiting from the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.