Switchable Lightweight Anti-symmetric Processing (SLAP) with CNN   Outspeeds Data Augmentation by Smaller Sample -- Application in Gomoku   Reinforcement Learning

Chi-Hang Suen; Eduardo Alonso (City; University of London)

arXiv:2301.04746·cs.LG·May 17, 2023

Switchable Lightweight Anti-symmetric Processing (SLAP) with CNN Outspeeds Data Augmentation by Smaller Sample -- Application in Gomoku Reinforcement Learning

Chi-Hang Suen, Eduardo Alonso (City, University of London)

PDF

Open Access

TL;DR

This paper introduces SLAP, a novel, model-independent method that enhances learning speed and reduces sample requirements in CNNs and reinforcement learning, demonstrated through Gomoku game experiments.

Contribution

SLAP is a new protocol that produces consistent outputs across transformations, significantly speeding up CNN training and reducing sample needs without data augmentation.

Findings

01

SLAP improved CNN convergence speed by 83%.

02

SLAP reduced training samples by a factor of 8 in Gomoku reinforcement learning.

03

SLAP achieved similar performance to data augmentation in reinforcement learning.

Abstract

To replace data augmentation, this paper proposed a method called SLAP to intensify experience to speed up machine learning and reduce the sample size. SLAP is a model-independent protocol/function to produce the same output given different transformation variants. SLAP improved the convergence speed of convolutional neural network learning by 83% in the experiments with Gomoku game states, with only one eighth of the sample size compared with data augmentation. In reinforcement learning for Gomoku, using AlphaGo Zero/AlphaZero algorithm with data augmentation as baseline, SLAP reduced the number of training samples by a factor of 8 and achieved similar winning rate against the same evaluator, but it was not yet evident that it could speed up reinforcement learning. The benefits should at least apply to domains that are invariant to symmetry or certain transformations. As future work,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural Networks and Applications · Machine Learning and Data Classification

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings