# ResNet-18 based multi-task visual inference and adaptive control for an edge-deployed autonomous robot

**Authors:** Sufola Das Chagas Silva Araujo, Goh Kah Ong Michael, Uttam U. Deshpande, Sudhindra Deshpande, Manjunath G. Avalappa, Yash Amasi, Sumit Patil, Swathi Bhat, Sudarshan Karigoudar

PMC · DOI: 10.3389/frobt.2025.1680285 · Frontiers in Robotics and AI · 2025-11-04

## TL;DR

A low-cost autonomous robot using edge computing and a modified ResNet-18 model performs multiple tasks like object recognition and path tracking for logistics in SMEs.

## Contribution

A novel autonomous logistics robot integrating multi-task visual inference and adaptive control on an edge platform for SME automation.

## Key findings

- The robot achieved 92% path tracking accuracy and 88% obstacle avoidance success in warehouse settings.
- It successfully handled objects with 90% success rate and maintained operation for 3 hours on a single charge.
- The system operates with a maximum perception-to-action latency of 150 ms.

## Abstract

Current industrial robots deployed in small and medium-sized businesses (SMEs) are too complex, expensive, or dependent on external computing resources. In order to bridge this gap, we introduce an autonomous logistics robot that combines adaptive control and visual perception on a small edge computing platform. The NVIDIA Jetson Nano was equipped with a modified ResNet-18 model that allowed it to concurrently execute three tasks: object-handling zone recognition, obstacle detection, and path tracking. A lightweight rack-and-pinion mechanism enables payload lifting of up to 2 kg without external assistance. Experimental evaluation in semi-structured warehouse settings demonstrated a path tracking accuracy of 92%, obstacle avoidance success of 88%, and object handling success of 90%, with a maximum perception-to-action latency of 150 m. The system maintains stable operation for up to 3 hours on a single charge. Unlike other approaches that focus on single functions or require cloud support, our design integrates navigation, perception, and mechanical handling into a low-power, standalone solution. This highlights its potential as a practical and cost-effective automation platform for SMEs.

## Full-text entities

- **Diseases:** AMRs (MESH:D014086)
- **Chemicals:** acrylic (-), lithium (MESH:D008094)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** L298N

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12624282/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12624282/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/PMC12624282/full.md

---
Source: https://tomesphere.com/paper/PMC12624282