Founder effects shape the evolutionary dynamics of multimodality in open LLM families
Manuel Cebrian

TL;DR
This study analyzes how multimodal capabilities emerge and spread within open large language model families, revealing founder effects and rapid within-lineage expansion that shape their evolutionary dynamics.
Contribution
It provides the first large-scale quantitative analysis of multimodality evolution in open LLM families, highlighting founder effects and lineage-specific expansion patterns.
Findings
Multimodality is rare within major LLM families until 2024-2025.
Most vision-language models appear as new roots without recorded parents.
Multimodal capabilities expand primarily within existing VLM lineages.
Abstract
Large language model (LLM) families are improving rapidly, yet it remains unclear how quickly multimodal capabilities emerge and propagate within open families. Using the ModelBiome AI Ecosystem dataset of Hugging Face model metadata and recorded lineage fields (>1.8x10^6 model entries), we quantify multimodality over time and along recorded parent-to-child relations. Cross-modal tasks are widespread in the broader ecosystem well before they become common within major open LLM families: within these families, multimodality remains rare through 2023 and most of 2024, then increases sharply in 2024-2025 and is dominated by image-text vision-language tasks. Across major families, the first vision-language model (VLM) variants typically appear months after the first text-generation releases, with lags ranging from ~1 month (Gemma) to more than a year for several families and ~26 months for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Computational and Text Analysis Methods · Multimodal Machine Learning Applications
