A Comparison of Large Language Model and Human Performance on Random Number Generation Tasks
Rachel M. Harrison

TL;DR
This study compares ChatGPT-3.5's ability to generate random number sequences with human performance, revealing that the model more effectively avoids predictable patterns, which has implications for cognitive science research.
Contribution
The paper adapts a human RNGT for LLMs and demonstrates that ChatGPT-3.5 exhibits fewer predictable patterns than humans in random number generation tasks.
Findings
ChatGPT-3.5 shows lower repeat frequencies than humans.
ChatGPT-3.5 avoids sequential patterns more effectively.
Further research will explore different models and prompting techniques.
Abstract
Random Number Generation Tasks (RNGTs) are used in psychology for examining how humans generate sequences devoid of predictable patterns. By adapting an existing human RNGT for an LLM-compatible environment, this preliminary study tests whether ChatGPT-3.5, a large language model (LLM) trained on human-generated text, exhibits human-like cognitive biases when generating random number sequences. Initial findings indicate that ChatGPT-3.5 more effectively avoids repetitive and sequential patterns compared to humans, with notably lower repeat frequencies and adjacent number frequencies. Continued research into different models, parameters, and prompting methodologies will deepen our understanding of how LLMs can more closely mimic human random generation behaviors, while also broadening their applications in cognitive and behavioral science research.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Topic Modeling
