SIMCODE: A Benchmark for Natural Language to ns-3 Network Simulation Code Generation

Tasnim Ahmed; Mirza Mohammad Azwad; Salimur Choudhury

arXiv:2507.11014·cs.NI·July 16, 2025

SIMCODE: A Benchmark for Natural Language to ns-3 Network Simulation Code Generation

Tasnim Ahmed, Mirza Mohammad Azwad, Salimur Choudhury

PDF

Open Access

TL;DR

This paper introduces SIMCODE, a benchmark for evaluating large language models' ability to generate ns-3 network simulation code from natural language, highlighting current capabilities and challenges.

Contribution

We present SIMCODE, the first comprehensive benchmark with 400 tasks for assessing LLMs in domain-specific network simulation code generation.

Findings

01

GPT-4.1 outperforms other models in accuracy

02

Execution accuracy remains modest and improvable

03

Missing headers and API mismatches are common errors

Abstract

Large language models (LLMs) have demonstrated remarkable capabilities in code generation across various domains. However, their effectiveness in generating simulation scripts for domain-specific environments like ns-3 remains underexplored. Despite the growing interest in automating network simulations, existing tools primarily focus on interactive automation over rigorous evaluation. To facilitate systematic evaluation, we introduce SIMCODE, the first benchmark to evaluate LLMs' ability to generate ns-3 simulation code from natural language. SIMCODE includes 400 tasks across introductory, intermediate, and advanced levels, with solutions and test cases. Using SIMCODE, we evaluate three prominent LLMs, Gemini-2.0, GPT-4.1, and Qwen-3, across six prompt techniques. Furthermore, investigating task-specific fine-tuning's impact reveals that while GPT-4.1 outperforms others, execution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications · Service-Oriented Architecture and Web Services