VectraYX-Nano: A 42M-Parameter Spanish Cybersecurity Language Model with Curriculum Learning and Native Tool Use

Juan S. Santillana

arXiv:2605.13989·cs.CL·May 22, 2026

VectraYX-Nano: A 42M-Parameter Spanish Cybersecurity Language Model with Curriculum Learning and Native Tool Use

Juan S. Santillana

PDF

1 Repo 4 Models

TL;DR

This paper introduces VectraYX-Nano, a Spanish cybersecurity language model with curriculum learning, native tool use, and a novel corpus, achieving competitive performance and efficient deployment.

Contribution

The paper presents a new Spanish cybersecurity language model with curriculum phases, native tool invocation, and detailed empirical analysis of training strategies and corpus effects.

Findings

01

Curriculum replay improves model performance monotonically.

02

Lower perplexity bootstraps can lead to worse conversational behavior.

03

Rebalancing tool-use ratio enhances model's tool integration capabilities.

Abstract

We present VectraYX-Nano, a 41.95M-parameter decoder-only language model trained from scratch in Spanish for cybersecurity, with a Latin-American regional focus and native tool invocation via the Model Context Protocol (MCP). The model has four contributions. (i) Corpus: VectraYX-Sec-ES, a 170M-token Spanish corpus assembled by an eight-VM distributed pipeline at ~$25 USD of cloud compute and split into three curriculum phases (conversational 42M, cybersecurity 118M, offensive tooling 10M). (ii) Architecture: a 42M Transformer decoder with GQA, QK-Norm, RMSNorm, SwiGLU, RoPE and z-loss, paired with a domain-balanced 16,384-token byte-fallback BPE. (iii) Curriculum with replay across the three phases yields a monotonic loss descent (9.80 -> 3.17 -> 3.00 -> 2.16); after SFT (loss 1.74) the v2 bootstrap-ablation reference attains a conversational gate of 0.775 +/- 0.043 on B5 over N=4…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vectrayx/vectrayx-nano-paper
github

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.