# Cosmos 1.0: a multidimensional map of the emerging technology frontier

**Authors:** Xian Gong, Paul X. McCarthy, Colin Griffith, Claire McFarland, Marian-Andrei Rizoiu

PMC · DOI: 10.1038/s41597-025-06125-y · 2025-11-19

## TL;DR

The paper introduces Cosmos 1.0, a dataset mapping emerging technologies using embeddings and indices to analyze their landscape.

## Contribution

A novel methodology for mapping technologies with contextual embeddings and new indices like Technology Awareness and Generality.

## Key findings

- Cosmos 1.0 includes 23,544 technology-adjacent entities with hierarchical structure and eight external indices.
- 100 emerging technologies were manually verified and categorized into seven thematic and three meta clusters.
- Indices like Technology Awareness and Age of Tech help assess the maturity and relevance of emerging technologies.

## Abstract

This paper introduces the Cosmos 1.0 dataset and describes a novel methodology for creating and mapping a universe of technologies, adjacent concepts, and entities. We utilise various source data that contain a rich diversity and breadth of contemporary knowledge. The Cosmos 1.0 dataset comprises 23,544 technology-adjacent entities (TA23k) with a hierarchical structure and eight categories of external indices. Each entity is represented by a 100-dimensional contextual embedding vector, which we use to assign it to seven thematic tech-clusters (TC7) and three meta tech-clusters (TC3). We manually verify 100 emerging technologies (ET100). This dataset is enriched with additional indices specifically developed to assess the landscape of emerging technologies, including the Technology Awareness Index, Generality Index, Deeptech, and Age of Tech Index. The dataset incorporates extensive metadata sourced from Wikipedia and linked data from third-party sources such as Crunchbase, Google Books, OpenAlex and Google Scholar, which are used to validate the relevance and accuracy of the constructed indices.

## Full-text entities

- **Diseases:** t-SNE (MESH:D020243)
- **Chemicals:** graphene (MESH:D006108)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12630975/full.md

---
Source: https://tomesphere.com/paper/PMC12630975