Exploring the Performance of ML/DL Architectures on the MNIST-1D Dataset
Michael Beebe, GodsGift Uzor, Manasa Chepuri, Divya Sree Vemula, Angel Ayala

TL;DR
This paper evaluates various neural network architectures on the MNIST-1D dataset, demonstrating that advanced models like TCN and DCNN outperform simpler ones, highlighting the importance of inductive biases and hierarchical features in small structured datasets.
Contribution
The study extends the benchmarking of MNIST-1D by systematically evaluating ResNet, TCN, and DCNN architectures, showing their effectiveness in small-scale sequential data tasks.
Findings
TCN and DCNN outperform simpler models on MNIST-1D
ResNet shows significant performance improvements
Advanced architectures achieve near-human accuracy
Abstract
Small datasets like MNIST have historically been instrumental in advancing machine learning research by providing a controlled environment for rapid experimentation and model evaluation. However, their simplicity often limits their utility for distinguishing between advanced neural network architectures. To address these challenges, Greydanus et al. introduced the MNIST-1D dataset, a one-dimensional adaptation of MNIST designed to explore inductive biases in sequential data. This dataset maintains the advantages of small-scale datasets while introducing variability and complexity that make it ideal for studying advanced architectures. In this paper, we extend the exploration of MNIST-1D by evaluating the performance of Residual Networks (ResNet), Temporal Convolutional Networks (TCN), and Dilated Convolutional Neural Networks (DCNN). These models, known for their ability to capture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Explainable Artificial Intelligence (XAI)
