Solving Rubik's Cube Without Tricky Sampling
Yicheng Lin, Siyu Liang

TL;DR
This paper introduces a novel reinforcement learning algorithm that enables a neural network to solve the 2x2x2 Rubik's Cube from fully scrambled states without relying on near-solved state sampling or tree search, achieving over 99.4% success.
Contribution
The paper presents a new RL method using policy gradients and neural networks to solve the Rubik's Cube directly from scrambled states, bypassing traditional sampling and search techniques.
Findings
Achieved over 99.4% success rate on 2x2x2 Rubik's Cube.
Solved the cube using only a policy network without tree search.
Demonstrated effectiveness in a sparse-reward environment.
Abstract
The Rubiks Cube, with its vast state space and sparse reward structure, presents a significant challenge for reinforcement learning (RL) due to the difficulty of reaching rewarded states. Previous research addressed this by propagating cost-to-go estimates from the solved state and incorporating search techniques. These approaches differ from human strategies that start from fully scrambled cubes, which can be tricky for solving a general sparse-reward problem. In this paper, we introduce a novel RL algorithm using policy gradient methods to solve the Rubiks Cube without relying on near solved-state sampling. Our approach employs a neural network to predict cost patterns between states, allowing the agent to learn directly from scrambled states. Our method was tested on the 2x2x2 Rubiks Cube, where the cube was scrambled 50,000 times, and the model successfully solved it in over 99.4%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputability, Logic, AI Algorithms · Mathematical and Theoretical Analysis
