A Domain Decomposition-Based CNN-DNN Architecture for Model Parallel   Training Applied to Image Recognition Problems

Axel Klawonn; Martin Lanser; and Janine Weber

arXiv:2302.06564·cs.LG·July 3, 2024

A Domain Decomposition-Based CNN-DNN Architecture for Model Parallel Training Applied to Image Recognition Problems

Axel Klawonn, Martin Lanser, and Janine Weber

PDF

Open Access

TL;DR

This paper introduces a novel CNN-DNN architecture inspired by domain decomposition methods, enabling parallel training of subnetworks on image data, which accelerates training and can improve classification accuracy.

Contribution

The paper proposes a new domain decomposition-based CNN-DNN architecture that supports model parallel training and enhances efficiency and accuracy in image recognition tasks.

Findings

01

Significantly reduces training time compared to traditional models.

02

Improves classification accuracy in various image recognition tasks.

03

Demonstrates effectiveness on 2D and 3D image datasets.

Abstract

Deep neural networks (DNNs) and, in particular, convolutional neural networks (CNNs) have brought significant advances in a wide range of modern computer application problems. However, the increasing availability of large amounts of datasets as well as the increasing available computational power of modern computers lead to a steady growth in the complexity and size of DNN and CNN models, respectively, and thus, to longer training times. Hence, various methods and attempts have been developed to accelerate and parallelize the training of complex network architectures. In this work, a novel CNN-DNN architecture is proposed that naturally supports a model parallel training strategy and that is loosely inspired by two-level domain decomposition methods (DDM). First, local CNN models, that is, subnetworks, are defined that operate on overlapping or nonoverlapping parts of the input data,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and ELM · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications