Learning with Value-Ramp

Tom J. Ameloot; Jan Van den Bussche

arXiv:1608.03647·cs.LG·April 25, 2017

Learning with Value-Ramp

Tom J. Ameloot, Jan Van den Bussche

PDF

Open Access

TL;DR

This paper introduces the Value-Ramp learning principle where agents follow increasing value sequences to reach reward peaks, offering a simple, natural, and robust approach for reinforcement learning.

Contribution

It proposes the novel Value-Ramp algorithm based on ramp-following intuition, emphasizing simplicity and robustness with natural number implementation.

Findings

01

The algorithm effectively guides agents towards reward peaks.

02

It is easy to configure and implement.

03

The approach is robust across different scenarios.

Abstract

We study a learning principle based on the intuition of forming ramps. The agent tries to follow an increasing sequence of values until the agent meets a peak of reward. The resulting Value-Ramp algorithm is natural, easy to configure, and has a robust implementation with natural numbers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Computability, Logic, AI Algorithms · Logic, Reasoning, and Knowledge