Combining Model-Free Q-Ensembles and Model-Based Approaches for Informed   Exploration

Sreecharan Sankaranarayanan; Raghuram Mandyam Annasamy; Katia Sycara,; Carolyn Penstein Ros\'e

arXiv:1806.04552·cs.LG·June 13, 2018

Combining Model-Free Q-Ensembles and Model-Based Approaches for Informed Exploration

Sreecharan Sankaranarayanan, Raghuram Mandyam Annasamy, Katia Sycara,, Carolyn Penstein Ros\'e

PDF

Open Access

TL;DR

This paper proposes integrating model-free Q-ensembles with model-based trajectory memory to enhance exploration in reinforcement learning, demonstrating improved performance over using Q-ensembles alone.

Contribution

It introduces a novel combination of Q-ensembles and model-based trajectory memory for better exploration in RL tasks.

Findings

01

Combined approach outperforms standalone Q-ensembles.

02

Model-based trajectory memory enhances exploration efficiency.

03

Results show significant performance gains in experiments.

Abstract

Q-Ensembles are a model-free approach where input images are fed into different Q-networks and exploration is driven by the assumption that uncertainty is proportional to the variance of the output Q-values obtained. They have been shown to perform relatively well compared to other exploration strategies. Further, model-based approaches, such as encoder-decoder models have been used successfully for next frame prediction given previous frames. This paper proposes to integrate the model-free Q-ensembles and model-based approaches with the hope of compounding the benefits of both and achieving superior exploration as a result. Results show that a model-based trajectory memory approach when combined with Q-ensembles produces superior performance when compared to only using Q-ensembles.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Advanced Vision and Imaging