# Generalised Discount Functions applied to a Monte-Carlo AImu   Implementation

**Authors:** Sean Lamont, John Aslanides, Jan Leike, Marcus Hutter

arXiv: 1703.01358 · 2017-03-07

## TL;DR

This paper extends a reinforcement learning simulation platform to include generalized discount functions, demonstrating their effects on agent behavior in a simple MDP and experimentally validating theoretical predictions.

## Contribution

It introduces the implementation of arbitrary discount functions in a Monte-Carlo RL platform and empirically investigates their impact on agent policies.

## Key findings

- Agent behavior aligns with theoretical expectations when parameters are properly set.
- Different discount functions influence agent decision-making as predicted by theory.
- The platform enables concrete experimentation with generalized discounting in RL.

## Abstract

In recent years, work has been done to develop the theory of General Reinforcement Learning (GRL). However, there are few examples demonstrating these results in a concrete way. In particular, there are no examples demonstrating the known results regarding gener- alised discounting. We have added to the GRL simulation platform AIXIjs the functionality to assign an agent arbitrary discount functions, and an environment which can be used to determine the effect of discounting on an agent's policy. Using this, we investigate how geometric, hyperbolic and power discounting affect an informed agent in a simple MDP. We experimentally reproduce a number of theoretical results, and discuss some related subtleties. It was found that the agent's behaviour followed what is expected theoretically, assuming appropriate parameters were chosen for the Monte-Carlo Tree Search (MCTS) planning algorithm.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1703.01358/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1703.01358/full.md

## References

17 references — full list in the complete paper: https://tomesphere.com/paper/1703.01358/full.md

---
Source: https://tomesphere.com/paper/1703.01358