Solving Rubik's Cube Without Tricky Sampling

Yicheng Lin; Siyu Liang

arXiv:2411.19583·cs.LG·December 2, 2024

Solving Rubik's Cube Without Tricky Sampling

Yicheng Lin, Siyu Liang

PDF

Open Access

TL;DR

This paper introduces a novel reinforcement learning algorithm that enables a neural network to solve the 2x2x2 Rubik's Cube from fully scrambled states without relying on near-solved state sampling or tree search, achieving over 99.4% success.

Contribution

The paper presents a new RL method using policy gradients and neural networks to solve the Rubik's Cube directly from scrambled states, bypassing traditional sampling and search techniques.

Findings

01

Achieved over 99.4% success rate on 2x2x2 Rubik's Cube.

02

Solved the cube using only a policy network without tree search.

03

Demonstrated effectiveness in a sparse-reward environment.

Abstract

The Rubiks Cube, with its vast state space and sparse reward structure, presents a significant challenge for reinforcement learning (RL) due to the difficulty of reaching rewarded states. Previous research addressed this by propagating cost-to-go estimates from the solved state and incorporating search techniques. These approaches differ from human strategies that start from fully scrambled cubes, which can be tricky for solving a general sparse-reward problem. In this paper, we introduce a novel RL algorithm using policy gradient methods to solve the Rubiks Cube without relying on near solved-state sampling. Our approach employs a neural network to predict cost patterns between states, allowing the agent to learn directly from scrambled states. Our method was tested on the 2x2x2 Rubiks Cube, where the cube was scrambled 50,000 times, and the model successfully solved it in over 99.4%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputability, Logic, AI Algorithms · Mathematical and Theoretical Analysis