SCPL: Enhancing Neural Network Training Throughput with Decoupled Local Losses and Model Parallelism

Ming-Yao Ho; Cheng-Kai Wang; You-Teng Lin; Hung-Hsuan Chen

arXiv:2602.00062·cs.LG·February 4, 2026

SCPL: Enhancing Neural Network Training Throughput with Decoupled Local Losses and Model Parallelism

Ming-Yao Ho, Cheng-Kai Wang, You-Teng Lin, Hung-Hsuan Chen

PDF

Open Access

TL;DR

SCPL introduces a novel training approach that decouples backpropagation, enabling parallel gradient computation across layers, significantly improving training throughput for large neural networks.

Contribution

The paper proposes Supervised Contrastive Parallel Learning (SCPL), a new method that decouples backpropagation to enhance model parallelism and training efficiency.

Findings

01

SCPL outperforms BP, Early Exit, GPipe, and AL in training efficiency.

02

SCPL reduces training time for large models.

03

Experimental results validate SCPL's effectiveness.

Abstract

Adopting large-scale AI models in enterprise information systems is often hindered by high training costs and long development cycles, posing a significant managerial challenge. The standard end-to-end backpropagation (BP) algorithm is a primary driver of modern AI, but it is also the source of inefficiency in training deep networks. This paper introduces a new training methodology, Supervised Contrastive Parallel Learning (SCPL), that addresses this issue by decoupling BP and transforming a long gradient flow into multiple short ones. This design enables the simultaneous computation of parameter gradients in different layers, achieving superior model parallelism and enhancing training throughput. Detailed experiments are presented to demonstrate the efficiency and effectiveness of our model compared to BP, Early Exit, GPipe, and Associated Learning (AL), a state-of-the-art method for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware System Performance and Reliability · IoT and Edge/Fog Computing · Software-Defined Networks and 5G