Stylized Structural Patterns for Improved Neural Network Pre-training
Farnood Salehi, Vandit Sharma, Amirhossein Askari Farsangi, Tun\c{c} Ozan Ayd{\i}n

TL;DR
This paper introduces a novel synthetic data generation method using neural fractals and reverse stylization, significantly reducing the domain gap and improving model performance in vision tasks when real data is scarce.
Contribution
It presents an improved neural fractal formulation and a reverse stylization technique to create more effective synthetic datasets for neural network pre-training.
Findings
Lowered domain gap between synthetic and real images using KID.
11% reduction in FID for diffusion models trained on synthetic data.
Over 10% accuracy improvement on ImageNet-100 with ViT-S.
Abstract
Modern deep learning models in computer vision require large datasets of real images, which are difficult to curate and pose privacy and legal concerns, limiting their commercial use. Recent works suggest synthetic data as an alternative, yet models trained with it often underperform. This paper proposes a two-step approach to bridge this gap. First, we propose an improved neural fractal formulation through which we introduce a new class of synthetic data. Second, we propose reverse stylization, a technique that transfers visual features from a small, license-free set of real images onto synthetic datasets, enhancing their effectiveness. We analyze the domain gap between our synthetic datasets and real images using Kernel Inception Distance (KID) and show that our method achieves a significantly lower distributional gap compared to existing synthetic datasets. Furthermore, our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
