Random forests, sound symbolism and Pokemon evolution

Alexander James Kilpatrick; Aleksandra Cwiek; Shigeto Kawahara

arXiv:2301.01948·cs.LG·January 6, 2023

Random forests, sound symbolism and Pokemon evolution

Alexander James Kilpatrick, Aleksandra Cwiek, Shigeto Kawahara

PDF

TL;DR

This paper demonstrates that random forests trained on Pokemon names can effectively classify evolutionary stages based on sound symbolism, outperforming human participants in accuracy, and introduces a novel cross-validation method to address overfitting.

Contribution

It presents a new application of machine learning to sound symbolism and evolution, and introduces a novel cross-validation technique for better model generalization.

Findings

01

Random forests classify Pokemon evolution with high accuracy.

02

Models outperform humans in classifying evolutionary stages.

03

A new cross-validation method reduces overfitting in sound symbolism models.

Abstract

This study constructs machine learning algorithms that are trained to classify samples using sound symbolism, and then it reports on an experiment designed to measure their understanding against human participants. Random forests are trained using the names of Pokemon, which are fictional video game characters, and their evolutionary status. Pokemon undergo evolution when certain in-game conditions are met. Evolution changes the appearance, abilities, and names of Pokemon. In the first experiment, we train three random forests using the sounds that make up the names of Japanese, Chinese, and Korean Pokemon to classify Pokemon into pre-evolution and post-evolution categories. We then train a fourth random forest using the results of an elicitation experiment whereby Japanese participants named previously unseen Pokemon. In Experiment 2, we reproduce those random forests with name length…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.