RAMario: Experimental Approach to Reptile Algorithm -- Reinforcement Learning for Mario
Sanyam Jain

TL;DR
This paper explores using the Reptile meta-learning algorithm to train a neural network for playing Super Mario Bros, comparing its performance with PPO and DQN, and demonstrating promising results in few-shot learning scenarios.
Contribution
It introduces an experimental application of Reptile to reinforcement learning in video games, showing competitive performance against established algorithms.
Findings
Reptile achieved a total distance of ~2300 in the game environment.
Reptile outperformed PPO and DQN in certain metrics.
The approach demonstrates potential for few-shot learning in game AI.
Abstract
This research paper presents an experimental approach to using the Reptile algorithm for reinforcement learning to train a neural network to play Super Mario Bros. We implement the Reptile algorithm using the Super Mario Bros Gym library and TensorFlow in Python, creating a neural network model with a single convolutional layer, a flatten layer, and a dense layer. We define the optimizer and use the Reptile class to create an instance of the Reptile meta-learning algorithm. We train the model using multiple tasks and episodes, choosing actions using the current weights of the neural network model, taking those actions in the environment, and updating the model weights using the Reptile algorithm. We evaluate the performance of the algorithm by printing the total reward for each episode. In addition, we compare the performance of the Reptile algorithm approach to two other popular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Educational Games and Gamification · Human Pose and Action Recognition
MethodsLib
