# Exploiting the Kumaraswamy distribution in a reinforcement learning context

**Authors:** Davide Picchi, Sigrid Brell-Çokcan

PMC · DOI: 10.3389/frobt.2025.1589025 · Frontiers in Robotics and AI · 2025-10-30

## TL;DR

This paper explores using the Kumaraswamy distribution in reinforcement learning for mini-crane automation, showing it offers computational benefits without sacrificing performance.

## Contribution

The novel use of the Kumaraswamy distribution in RL for continuous control tasks is proposed and validated.

## Key findings

- The Kumaraswamy distribution provides computational advantages over Gaussian distributions in RL.
- It maintains robust performance in continuous control tasks like mini-crane automation.
- This distribution is a viable alternative for action selection in reinforcement learning.

## Abstract

Mini cranes play a pivotal role in construction due to their versatility across numerous scenarios. Recent advancements in Reinforcement Learning (RL) have enabled agents to operate cranes in virtual environments for predetermined tasks, paving the way for future real-world deployment. Traditionally, most RL agents use a squashed Gaussian distribution to select actions. In this study, we investigate a mini-crane scenario that could potentially be fully automated by AI and explore replacing the Gaussian distribution with the Kumaraswamy distribution, a close relative of the Beta distribution, for action stochastic selection. Our results indicate that the Kumaraswamy distribution offers computational advantages while maintaining robust performance, making it an attractive alternative for RL applications in continuous control applications.

## Full-text entities

- **Genes:** KL (klotho) [NCBI Gene 9365] {aka HFTC3, KLA}
- **Chemicals:** PPO (-), steel (MESH:D013232)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** A2C

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12611641/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12611641/full.md

## References

15 references — full list in the complete paper: https://tomesphere.com/paper/PMC12611641/full.md

---
Source: https://tomesphere.com/paper/PMC12611641