Identifying Disruptive Models in the Open-Source LLM Community
Xiaoting Wei, Lele Kang, Xuelian Pan, Jiannan Yang

TL;DR
This paper introduces the Model Disruption Index (MDI) to identify models that significantly influence subsequent development in the open-source LLM community, revealing a mostly consolidative ecosystem with some models acting as disruptive catalysts.
Contribution
It presents a novel metric, MDI, for detecting disruptive models in large-scale open-source LLMs, and analyzes their characteristics and emergence patterns.
Findings
Most open-source LLM models are consolidative, not disruptive.
Disruptive models tend to be large-scale and involve finetuning.
Disruptive positions are more likely among large models.
Abstract
The rapid growth of open-source large language models (LLMs) has created a complex ecosystem of model inheritance and reuse. However, existing research has focused mainly on descriptive analyses of lineage evolution, with limited attention to identifying which models play a disruptive role in shaping subsequent development. Using metadata from 2,556,240 models on Hugging Face, this study reconstructs a large-scale lineage network and introduces the Model Disruption Index (MDI) to distinguish between models that reinforce existing technological trajectories and those that become new bases for later development. The results show that most models in the open-source LLM community are consolidative rather than disruptive, reflecting a highly concentrated and path-dependent evolutionary structure. Further analyses suggest that disruptive positions are more likely to emerge among large-scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
