Evaluation of Categorical Generative Models -- Bridging the Gap Between Real and Synthetic Data
Florence Regol, Anja Kroon, Mark Coates

TL;DR
This paper proposes a realistic, scalable evaluation method for categorical generative models that assesses their ability to learn complex distributions by progressively testing on smaller, more challenging probability spaces.
Contribution
It introduces a novel evaluation framework based on ground truth and high-dimensional binning, tailored for realistic categorical data modeling tasks.
Findings
The method effectively distinguishes model performance across different difficulty levels.
State-of-the-art models are evaluated, revealing their strengths and limitations.
The approach provides a more reliable assessment of generative models in practical scenarios.
Abstract
The machine learning community has mainly relied on real data to benchmark algorithms as it provides compelling evidence of model applicability. Evaluation on synthetic datasets can be a powerful tool to provide a better understanding of a model's strengths, weaknesses, and overall capabilities. Gaining these insights can be particularly important for generative modeling as the target quantity is completely unknown. Multiple issues related to the evaluation of generative models have been reported in the literature. We argue those problems can be avoided by an evaluation based on ground truth. General criticisms of synthetic experiments are that they are too simplified and not representative of practical scenarios. As such, our experimental setting is tailored to a realistic generative task. We focus on categorical data and introduce an appropriately scalable evaluation method. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistics Education and Methodologies · Music and Audio Processing · Time Series Analysis and Forecasting
