# Adaptive Non-Singular Fast Terminal Sliding Mode Trajectory Tracking Control for Robotic Manipulator with Novel Configuration Based on TD3 Deep Reinforcement Learning and Nonlinear Disturbance Observer

**Authors:** Huaqiang You, Yanjun Liu, Zhenjie Shi, Zekai Wang, Lin Wang, Gang Xue

PMC · DOI: 10.3390/s26010297 · Sensors (Basel, Switzerland) · 2026-01-02

## TL;DR

This paper introduces a new control strategy for robotic manipulators that combines deep reinforcement learning and disturbance observers to improve tracking accuracy and robustness.

## Contribution

The novel integration of TD3 deep reinforcement learning with NFTSMC and NDO for enhanced trajectory tracking in robotic manipulators.

## Key findings

- The proposed TD3NDONFT algorithm reduces position tracking errors by up to 19.94% in robotic manipulator joints.
- Velocity tracking errors are also reduced, with the highest improvement of 9.10% in one joint.
- The algorithm demonstrates strong robustness against sudden disturbances and unknown time-varying disturbances.

## Abstract

This work proposes a non-singular fast terminal sliding mode control (NFTSMC) strategy based on the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm and a nonlinear disturbance observer (NDO) to address the issues of modeling errors, motion disturbances, and transmission friction in robotic manipulators. Firstly, a novel modular serial 5-DOF robotic manipulator configuration is designed, and its kinematic and dynamic models are established. Secondly, a nonlinear disturbance observer is employed to estimate the total disturbance of the system and apply feedforward compensation. Based on boundary layer technology, an improved NFTSMC method is proposed to accelerate the convergence of tracking errors, reduce chattering, and avoid singularity issues inherent in traditional terminal sliding mode control. The stability of the designed control system is proved using Lyapunov stability theory. Subsequently, a deep reinforcement learning (DRL) agent based on the TD3 algorithm is trained to adaptively adjust the control gains of the non-singular fast terminal sliding mode controller. The dynamic information of the robotic manipulator is used as the input to the TD3 agent, which searches for optimal controller parameters within a continuous action space. A composite reward function is designed to ensure the stable and efficient learning of the TD3 agent. Finally, the motion characteristics of three joints for the designed 5-DOF robotic manipulator are analyzed. The results show that compared to the non-singular fast terminal sliding mode control algorithm based on a nonlinear disturbance observer (NDONFT), the non-singular fast terminal sliding mode control algorithm integrating a nonlinear disturbance observer and the Twin Delayed Deep Deterministic Policy Gradient algorithm (TD3NDONFT) reduces the mean absolute error of position tracking for the three joints by 7.14%, 19.94%, and 6.14%, respectively, and reduces the mean absolute error of velocity tracking by 1.78%, 9.10%, and 2.11%, respectively. These results verify the effectiveness of the proposed algorithm in enhancing the trajectory tracking accuracy of the robotic manipulator under unknown time-varying disturbances and demonstrate its strong robustness against sudden disturbances.

## Full-text entities

- **Diseases:** joint (MESH:D007592), injury to (MESH:D014947)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12788352/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12788352/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/PMC12788352/full.md

---
Source: https://tomesphere.com/paper/PMC12788352