MNIST-Gen: A Modular MNIST-Style Dataset Generation Using Hierarchical Semantics, Reinforcement Learning, and Category Theory

Pouya Shaeri; Arash Karimi; Ariane Middel

arXiv:2507.11821·cs.LG·July 17, 2025

MNIST-Gen: A Modular MNIST-Style Dataset Generation Using Hierarchical Semantics, Reinforcement Learning, and Category Theory

Pouya Shaeri, Arash Karimi, Ariane Middel

PDF

Open Access

TL;DR

MNIST-Gen is an innovative framework that automates the creation of customizable, hierarchical, and semantically meaningful MNIST-style datasets using reinforcement learning, CLIP, and category theory principles.

Contribution

It introduces a modular, adaptive system for generating tailored datasets with minimal manual effort, integrating semantic understanding and hierarchical categorization.

Findings

01

Achieved 85% automatic categorization accuracy.

02

Demonstrated 80% time savings over manual dataset creation.

03

Generated novel datasets: Tree-MNIST and Food-MNIST.

Abstract

Neural networks are often benchmarked using standard datasets such as MNIST, FashionMNIST, or other variants of MNIST, which, while accessible, are limited to generic classes such as digits or clothing items. For researchers working on domain-specific tasks, such as classifying trees, food items, or other real-world objects, these data sets are insufficient and irrelevant. Additionally, creating and publishing a custom dataset can be time consuming, legally constrained, or beyond the scope of individual projects. We present MNIST-Gen, an automated, modular, and adaptive framework for generating MNIST-style image datasets tailored to user-specified categories using hierarchical semantic categorization. The system combines CLIP-based semantic understanding with reinforcement learning and human feedback to achieve intelligent categorization with minimal manual intervention. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques