The ATOM Report: Measuring the Open Language Model Ecosystem

Nathan Lambert; Florian Brand

arXiv:2604.07190·cs.CY·April 9, 2026

The ATOM Report: Measuring the Open Language Model Ecosystem

Nathan Lambert, Florian Brand

PDF

TL;DR

This paper provides a detailed snapshot of the open language model ecosystem, highlighting adoption trends, regional shifts, and key metrics for leading models like Llama and Qwen.

Contribution

It offers the first comprehensive analysis of open language model adoption, regional dominance shifts, and ecosystem dynamics based on diverse data sources.

Findings

01

Chinese models overtook U.S. models in summer 2025

02

The ecosystem includes ~1.5K mainline open models

03

Analysis covers downloads, derivatives, market share, and performance

Abstract

We present a comprehensive adoption snapshot of the leading open language models and who is building them, focusing on the ~1.5K mainline open models from the likes of Alibaba's Qwen, DeepSeek, Meta's Llama, that are the foundation of an ecosystem crucial to researchers, entrepreneurs, and policy advisors. We document a clear trend where Chinese models overtook their counterparts built in the U.S. in the summer of 2025 and subsequently widened the gap over their western counterparts. We study a mix of Hugging Face downloads and model derivatives, inference market share, performance metrics and more to make a comprehensive picture of the ecosystem.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.