Self-Supervision is All You Need for Solving Rubik's Cube

Kyo Takano

arXiv:2106.03157·cs.LG·May 24, 2023

Self-Supervision is All You Need for Solving Rubik's Cube

Kyo Takano

PDF

Open Access 1 Repo

TL;DR

This paper presents a simple deep learning approach that uses self-supervision to efficiently solve combinatorial puzzles like Rubik's Cube, outperforming previous methods with less data and training.

Contribution

It introduces a self-supervised deep learning method that achieves near-optimal solutions for combinatorial puzzles, reducing complexity and training requirements.

Findings

01

Outperforms DeepCubeA in solution quality and efficiency

02

Requires less training data for comparable results

03

Scales effectively with model size and data volume

Abstract

Existing combinatorial search methods are often complex and require some level of expertise. This work introduces a simple and efficient deep learning method for solving combinatorial problems with a predefined goal, represented by Rubik's Cube. We demonstrate that, for such problems, training a deep neural network on random scrambles branching from the goal state is sufficient to achieve near-optimal solutions. When tested on Rubik's Cube, 15 Puzzle, and 7 $\times$ 7 Lights Out, our method outperformed the previous state-of-the-art method DeepCubeA, improving the trade-off between solution optimality and computational cost, despite significantly less training data. Furthermore, we investigate the scaling law of our Rubik's Cube solver with respect to model size and training data volume.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kyo-takano/EfficientCube
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Software Engineering Research · Metaheuristic Optimization Algorithms Research