Accelerated Multi-objective Task Learning using Modified Q-learning   Algorithm

Varun Prakash Rajamohan; Senthil Kumar Jagatheesaperumal

arXiv:2409.01046·cs.RO·September 4, 2024

Accelerated Multi-objective Task Learning using Modified Q-learning Algorithm

Varun Prakash Rajamohan, Senthil Kumar Jagatheesaperumal

PDF

TL;DR

This paper introduces a modified Q-learning algorithm called Q-SD that improves task learning efficiency and effectiveness in robotic table cleaning by minimizing movement distance and increasing success rates.

Contribution

It proposes the Q-SD algorithm, a novel modification to Q-learning that enhances task learning and movement efficiency in robotic applications.

Findings

01

Success rate of 86% and 59% in two grid environments.

02

Reduction in movement distance by 8.61% and 6.7%.

03

Improved task completion efficiency over traditional Q-learning.

Abstract

Robots find extensive applications in industry. In recent years, the influence of robots has also increased rapidly in domestic scenarios. The Q-learning algorithm aims to maximise the reward for reaching the goal. This paper proposes a modified version of the Q-learning algorithm, known as Q-learning with scaled distance metric (Q-SD). This algorithm enhances task learning and makes task completion more meaningful. A robotic manipulator (agent) applies the Q-SD algorithm to the task of table cleaning. Using Q-SD, the agent acquires the sequence of steps necessary to accomplish the task while minimising the manipulator's movement distance. We partition the table into grids of different dimensions. The first has a grid count of 3 times 3, and the second has a grid count of 4 times 4. Using the Q-SD algorithm, the maximum success obtained in these two environments was 86% and 59%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsQ-Learning