A Domain Decomposition-Based CNN-DNN Architecture for Model Parallel Training Applied to Image Recognition Problems
Axel Klawonn, Martin Lanser, and Janine Weber

TL;DR
This paper introduces a novel CNN-DNN architecture inspired by domain decomposition methods, enabling parallel training of subnetworks on image data, which accelerates training and can improve classification accuracy.
Contribution
The paper proposes a new domain decomposition-based CNN-DNN architecture that supports model parallel training and enhances efficiency and accuracy in image recognition tasks.
Findings
Significantly reduces training time compared to traditional models.
Improves classification accuracy in various image recognition tasks.
Demonstrates effectiveness on 2D and 3D image datasets.
Abstract
Deep neural networks (DNNs) and, in particular, convolutional neural networks (CNNs) have brought significant advances in a wide range of modern computer application problems. However, the increasing availability of large amounts of datasets as well as the increasing available computational power of modern computers lead to a steady growth in the complexity and size of DNN and CNN models, respectively, and thus, to longer training times. Hence, various methods and attempts have been developed to accelerate and parallelize the training of complex network architectures. In this work, a novel CNN-DNN architecture is proposed that naturally supports a model parallel training strategy and that is loosely inspired by two-level domain decomposition methods (DDM). First, local CNN models, that is, subnetworks, are defined that operate on overlapping or nonoverlapping parts of the input data,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
