An Exploration of Deep Learning Methods in Hungry Geese

Nikzad Khani; Matthew Kluska

arXiv:2109.01954·cs.AI·September 7, 2021

An Exploration of Deep Learning Methods in Hungry Geese

Nikzad Khani, Matthew Kluska

PDF

Open Access 1 Repo

TL;DR

This paper evaluates various deep reinforcement learning value methods in the stochastic environment of Hungry Geese, highlighting the limitations of Deep Q Networks and suggesting potential improvements and alternative models.

Contribution

It systematically compares Deep Q, Double Q, and Dueling Q-Networks in Hungry Geese, revealing the strengths and weaknesses of each in a complex environment.

Findings

01

Vanilla Deep Q Network performed best among tested models.

02

Convergence to optimal policy was challenging due to environment stochasticity.

03

Deep Q Networks may not be ideal for highly stochastic environments.

Abstract

Hungry Geese is a n-player variation of the popular game snake. This paper looks at state of the art Deep Reinforcement Learning Value Methods. The goal of the paper is to aggregate research of value based methods and apply it as an exercise to other environments. A vanilla Deep Q Network, a Double Q-network and a Dueling Q-Network were all examined and tested with the Hungry Geese environment. The best performing model was the vanilla Deep Q Network due to its simple state representation and smaller network structure. Converging towards an optimal policy was found to be difficult due to random geese initialization and food generation. Therefore we show that Deep Q Networks may not be the appropriate model for such a stochastic environment and lastly we present improvements that can be made along with more suitable models for the environment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nikzadkhani/famished-geese
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Metaheuristic Optimization Algorithms Research