On the use of LLMs to generate a dataset of Neural Networks

Nadia Daoudi; Jordi Cabot

arXiv:2602.04388·cs.LG·February 5, 2026

On the use of LLMs to generate a dataset of Neural Networks

Nadia Daoudi, Jordi Cabot

PDF

Open Access

TL;DR

This paper presents a method using large language models to automatically generate a diverse, validated dataset of neural networks for benchmarking and evaluating tools that verify, refactor, and migrate neural architectures.

Contribution

The authors introduce a novel approach leveraging LLMs to create a publicly available, diverse neural network dataset for systematic evaluation of neural network tools.

Findings

01

Generated 608 neural network samples covering diverse architectures

02

Validated network correctness using static analysis and symbolic tracing

03

Dataset supports benchmarking neural network verification and adaptation tools

Abstract

Neural networks are increasingly used to support decision-making. To verify their reliability and adaptability, researchers and practitioners have proposed a variety of tools and methods for tasks such as NN code verification, refactoring, and migration. These tools play a crucial role in guaranteeing both the correctness and maintainability of neural network architectures, helping to prevent implementation errors, simplify model updates, and ensure that complex networks can be reliably extended and reused. Yet, assessing their effectiveness remains challenging due to the lack of publicly diverse datasets of neural networks that would allow systematic evaluation. To address this gap, we leverage large language models (LLMs) to automatically generate a dataset of neural networks that can serve as a benchmark for validation. The dataset is designed to cover diverse architectural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Software Engineering Research