# MallaNet residual branch merge convolutional neural network with homogeneous filter capsules for Devanagari character recognition

**Authors:** Sahaj Raj Malla

PMC · DOI: 10.1038/s41598-025-30871-z · Scientific Reports · 2025-12-03

## TL;DR

This paper introduces MallaNet, a deep learning model that improves Devanagari character recognition with high accuracy and fewer resources.

## Contribution

The novel MallaNet model combines residual blocks and homogeneous filter capsules for efficient and accurate Devanagari character recognition.

## Key findings

- MallaNet achieves 99.71% test accuracy on the DHCD dataset with 92,000 images.
- The model uses 56.41% fewer parameters than previous models while maintaining high performance.
- It outperforms existing benchmarks in Devanagari character recognition.

## Abstract

The Devanagari script’s complex character set and handwriting variability pose significant challenges for handwritten character recognition (HCR). This study aims to develop a robust deep learning model, MallaNet, to achieve high accuracy in recognizing Devanagari characters by leveraging multiscale feature extraction and preserving spatial hierarchies. We introduce the Residual Enhanced Branching and Merging Convolutional Neural Network with Homogeneous Filter Capsules (MallaNet), an optimized deep learning model designed to address these complexities. Extending the Branching and Merging Convolutional Network with Homogeneous Vector Capsules (BMCNNwHVCs), our model integrates optimized residual blocks, refined Homogeneous Filter Capsule (HFC) layers, and a merging layer to capture multiscale features and preserve spatial hierarchies, critical for distinguishing visually similar characters. Trained in the Devanagari Handwritten Character Dataset (DHCD), comprising 92,000 images across 46 classes, MallaNet achieves a test accuracy of 99.71%, macro-average F1-score of 99.71%, closely approaching the highest reported accuracy of 99.72% while utilizing 56.41% fewer parameters (17M vs. 39M) and surpassing previous benchmarks of 98.47% and 99.16%, enhancing optical character recognition (OCR) for regional scripts and supporting document digitization and cultural preservation with improved efficiency.

## Full-text entities

- **Diseases:** stroke (MESH:D020521), HCR (MESH:D020238)
- **Chemicals:** DHCD (-)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12789527/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12789527/full.md

## References

13 references — full list in the complete paper: https://tomesphere.com/paper/PMC12789527/full.md

---
Source: https://tomesphere.com/paper/PMC12789527