Experience enrichment based task independent reward model

Min Xu

arXiv:1705.07460·cs.AI·May 23, 2017

Experience enrichment based task independent reward model

Min Xu

PDF

Open Access

TL;DR

This paper introduces a task-independent implicit reward model for reinforcement learning that derives rewards from deviations in agents' experiences, aiming to better reflect real-world complex interactions.

Contribution

The paper proposes a novel implicit reward model that is task-independent and based on experience deviations, differing from traditional manually defined rewards.

Findings

01

The implicit reward model effectively captures complex interactions.

02

It improves the generality of reinforcement learning approaches.

03

The model is adaptable to various environments.

Abstract

For most reinforcement learning approaches, the learning is performed by maximizing an accumulative reward that is expectedly and manually defined for specific tasks. However, in real world, rewards are emergent phenomena from the complex interactions between agents and environments. In this paper, we propose an implicit generic reward model for reinforcement learning. Unlike those rewards that are manually defined for specific tasks, such implicit reward is task independent. It only comes from the deviation from the agents' previous experiences.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Neural dynamics and brain function