Cosmos 1.0: a multidimensional map of the emerging technology frontier
Xian Gong, Paul X. McCarthy, Colin Griffith, Claire McFarland, Marian-Andrei Rizoiu

TL;DR
This paper presents Cosmos 1.0, a comprehensive multidimensional map of emerging technologies, utilizing a large dataset with rich metadata, embeddings, and indices to analyze the technology landscape and identify emerging trends.
Contribution
Introduces Cosmos 1.0, a novel dataset and methodology for mapping and analyzing the technology frontier using diverse data sources and advanced embedding techniques.
Findings
Dataset includes 23,544 technology-adjacent entities with hierarchical structure.
Manual verification of 100 emerging technologies confirms dataset relevance.
Developed indices to assess technology awareness, generality, and maturity.
Abstract
This paper introduces the Cosmos 1.0 dataset and describes a novel methodology for creating and mapping a universe of technologies, adjacent concepts, and entities. We utilise various source data that contain a rich diversity and breadth of contemporary knowledge. The Cosmos 1.0 dataset comprises 23,544 technology-adjacent entities (TA23k) with a hierarchical structure and eight categories of external indices. Each entity is represented by a 100-dimensional contextual embedding vector, which we use to assign it to seven thematic tech-clusters (TC7) and three meta tech-clusters (TC3). We manually verify 100 emerging technologies (ET100). This dataset is enriched with additional indices specifically developed to assess the landscape of emerging technologies, including the Technology Awareness Index, Generality Index, Deeptech, and Age of Tech Index. The dataset incorporates extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Digital Economy · University-Industry-Government Innovation Models · Intellectual Property and Patents
