# RepACNet: A Lightweight Reparameterized Asymmetric Convolution Network for Monocular Depth Estimation

**Authors:** Wanting Jiang, Jun Li, Yaoqian Niu, Hao Chen, Shuang Peng

PMC · DOI: 10.3390/s26041199 · 2026-02-12

## TL;DR

RepACNet is a lightweight neural network for estimating depth from a single image, designed to work efficiently on mobile devices without sacrificing accuracy.

## Contribution

RepACNet introduces reparameterized asymmetric convolution designs and integrates MLP-Mixer components for efficient monocular depth estimation.

## Key findings

- RepACNet achieves competitive performance on NYU Depth v2 and KITTI Eigen benchmarks.
- The model maintains significantly fewer parameters than state-of-the-art methods.
- RepTMAC and SECDC components enable efficient global feature interaction and multi-scale depth feature capture.

## Abstract

Monocular depth estimation (MDE) is a cornerstone task in 2D/3D scene reconstruction and recognition with widespread applications in autonomous driving, robotics, and augmented reality. However, existing state-of-the-art methods face a fundamental trade-off between computational efficiency and estimation accuracy, limiting their deployment in resource-constrained real-world scenarios. It is of high interest to design lightweight but effective models to enable potential deployment on resource-constrained mobile devices. To address this problem, we present RepACNet, a novel lightweight network that addresses this challenge through reparameterized asymmetric convolution designs and CNN-based architecture that integrates MLP-Mixer components. First, we propose Reparameterized Token Mixer with Asymmetric Convolution (RepTMAC), an efficient block that captures long-range dependencies while maintaining linear computational complexity. Unlike Transformer-based methods, our approach achieves global feature interaction with tiny overhead. Second, we introduce Squeeze-and-Excitation Consecutive Dilated Convolutions (SECDCs), which integrates adaptive channel attention with dilated convolutions to capture depth-specific features across multiple scales. We validate the effectiveness of our approach through extensive experiments on two widely recognized benchmarks, NYU Depth v2 and KITTI Eigen. The experimental results demonstrate that our model achieves competitive performance while maintaining significantly fewer parameters compared to state-of-the-art models.

## Full-text entities

- **Diseases:** SECDC block (MESH:D002311), injury to (MESH:D014947), MDE (MESH:D007222)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12944362/full.md

---
Source: https://tomesphere.com/paper/PMC12944362