# Port terminal mobile recognition based on combined YOLOv5s-DeepSort

**Authors:** Chengzhi Wang, Donghong Chen, Zhen Liu, Yuanhao Li, Yifei Wang, Sanglan Zhao

PMC · DOI: 10.1371/journal.pone.0326376 · PLOS One · 2025-07-10

## TL;DR

This paper introduces an improved model for object detection and tracking in port environments using YOLOv5s and DeepSORT, achieving high accuracy in challenging conditions.

## Contribution

The novel integration of multi-scale convolution, EPSA network, and distributed sorting loss enhances detection and tracking in port video images.

## Key findings

- Multi-scale convolution improved detection robustness with a 0.4% increase in mAP.
- EPSA network boosted mAP@0.5:0.95 by 1.2% through better feature fusion.
- Distributed sorting loss improved tracking accuracy by 3.1% (MOTA).

## Abstract

To solve the problem of reduced positioning accuracy caused by changes in scale, background and occlusion in port and dock video images, this research proposes an enhanced model combining YOLOv5s-DeepSORT, integrating target load recognition and trajectory tracking to improve adaptability to dock environments. The findings indicate that incorporating multi-scale convolution into YOLOv5s improved the robustness of multi-scale object detection, resulting in a 0.4% increase in mean Average Precision (mAP). Furthermore, the integration of an efficient pyramid segmentation attention (EPSA) network enhanced the accuracy of multi-scale feature fusion representation. The model’s mAP@0.5:0.95 increased by 1.2% following the introduction of EPSA. Finally, the original classification loss function was enhanced using a distributed sorting loss approach to mitigate the imbalance among loaded objects and the influence of background variations in the dock image sequence. This optimization led to a 3.1% improvement in multi-target tracking accuracy (MOTA). Experimental results on self-constructed datasets demonstrated an average accuracy of 90.9% and a detection accuracy of 92.2%, offering a valuable reference for target recognition and tracking in port and dock environments.

## Full-text entities

- **Genes:** IDH1 (isocitrate dehydrogenase (NADP(+)) 1) [NCBI Gene 3417] {aka HEL-216, HEL-S-26, IDCD, IDH, IDP, IDPC}
- **Diseases:** EPSA (MESH:C537538), occlusion (MESH:D001157), ID (MESH:C537985)
- **Chemicals:** Cranes (-)
- **Cell lines:** MOT16 — Homo sapiens (Human), Hairy cell leukemia, Cancer cell line (CVCL_1439)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12244586/full.md

## Figures

19 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12244586/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/PMC12244586/full.md

---
Source: https://tomesphere.com/paper/PMC12244586