Dataset on global trade networks of COVID-19 medical products
Marcell T. Kurbucz, Tibor Keresztély, Szabolcs Szikszai, András Sugár, Zsuzsanna Banász

TL;DR
This paper introduces a dataset tracking global trade of COVID-19 medical products in 2019 and 2020, structured as a multilevel network for analysis.
Contribution
The novel contribution is a multilevel network dataset of global trade in eight categories of medical products related to the pandemic.
Findings
The dataset includes trade flows for 224 countries across eight product categories.
Trade dynamics are represented as a multilevel network with product categories as layers.
An R script is provided to convert the dataset into a format for network analysis and visualization.
Abstract
This paper presents a comprehensive dataset on the global trade dynamics of COVID-19-related medical products for the years 2019 and 2020. The dataset, derived from the BACI database, focuses on eight distinct product categories identified by six-digit codes. The trade flow data for 224 countries is structured as a multilevel network, with countries as nodes and product categories as layers. Directed edges represent trading activities, and edge weights are determined by the difference in exported values between 2019 and 2020. The dataset is provided in an edges-and-nodes format. Additionally, the associated R script transforms the data into the MuxViz R package format, facilitating further analysis and visualization of the dataset. The dataset is valuable for researchers in the field of foreign trade or medical products, and for decision-makers in these fields, whether at company or…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Economic and Technological Innovation · Complex Network Analysis Techniques
Specifications TableSubjectEconomics, Econometrics and Finance, Data Science, Social SciencesSpecific subject areaEconomics, Econometrics, Data Mining and Statistical Analysis, HealthType of dataRaw data.Comma-separated values files (csv) and an R script (R). Additional files generated after running the R script include edge files (edges) and space- and semicolon-separated files (txt).Data collectionWe obtained international trading data for 2019 and 2020 from the BACI database [1,2]. Utilizing this data, we identified COVID-19-related medical products and their eight categories based on information from the World Integrated Trade Solution's webpage [3]. Subsequently, we calculated the differences in traded values for each product over the investigated two years. Finally, we employed an R script to transform the acquired network into a MuxViz-compatible [4,5] multilevel network, with each layer representing one of the eight investigated product categories.Data source locationTrading data were collected from the BACI database [1,2]. COVID-19-related medical products and their eight categories were identified based on information from the World Integrated Trade Solution's webpage [3].Data accessibilityRepository name: Mendeley DataData identification number: 10.17632/b7svp82sdjDirect URL to data: https://data.mendeley.com/datasets/b7svp82sdjInstructions for accessing these data: Freely available on Mendeley Data [6].Related research articleM.T. Kurbucz, A. Sugár, T. Keresztély, Analysis of the international trade networks of COVID-19 medical products. Appl. Netw. Sci. 8 (2023) 58. https://doi.org/10.1007/s41109-023-00586-z [7]
Value of the Data
1
- •This dataset provides a comprehensive exploration of global trade dynamics in COVID-19-related medical products for 2019 and 2020, offering valuable insights into the evolution of international connections during this crucial period.
- •Focused on eight specific medical product categories identified by six-digit codes, the dataset enables a targeted analysis, providing insights into trade dynamics within distinct sectors.
- •With a unique multilevel network structure portraying countries as nodes and product categories (as layers), the dataset visually captures the interconnectedness in trade relationships (as edges), enhancing the understanding of global connections.
- •Presented in an edges-and-nodes format (where the nodes are the countries and the edges are the trade between them) with an accompanying R script, the dataset ensures accessibility and facilitates easy transformation into the MuxViz R package, that handles multilevel network format, empowering researchers for in-depth analysis and visualization.
- •The dataset is useful for researchers and decision-makers in foreign trade and medical products, at both company and national levels.
- •It offers versatile applicability and the MuxViz format, which operates through a user-friendly graphical user interface (GUI).
Background
2
Motivated by the unforeseen challenges posed by the COVID-19 pandemic [8], this dataset was created to explore the complexities of global trade networks for medical products during this period. By the end of April 2020, more than 80 countries had introduced export restrictions on critical medical products [9]. The economic and geopolitical impact of COVID-19 and related restrictions on world trade has been examined in a number of studies (see, e.g., [7,10,11]). The aim of this dataset is to deepen our knowledge in this domain by providing a comprehensive dataset in both a simple network and multilevel structure.
Data Description
3
Single-layer network format
3.1
The dataset consists of two comma-separated values (csv) files, edges.csv and nodes.csv, which store the edge and node information for a single-layer trading network, respectively. In this network, the nodes represent the 224 countries included in the dataset, and the edges represent the differences in exported values from 2019 to 2020, expressed in thousands of USD. The nodes.csv file includes the node ID, the three-letter country codes, and the corresponding geolocations. The edges.csv file contains the exporter and importer node IDs, the eight product categories of COVID-19 medical products, as well as the differences in exported values. The variables of both files are detailed in Table 1.Table 1. Single-layer network format.Table 1. VariableDescriptionTypeSourcenodeIDNode (country) ID.Integernodes.csv**nodeLabelThree-letter country codes (ISO 3166-1 alpha-3).Stringnodes.csv**nodeLatLatitude of the node (country).Floatnodes.csv**nodeLongLongitude of the node (country).Floatnodes.csv**prodCatProduct categories. A: Medical test kits; B: Disinfectants and sterilization products; C: Other medical consumables; D: Other medical devices and equipment; E: Other medical-related goods; F: Oxygen therapy equipment and pulse oximeters; G: Protective garments; H: Vehicles. Further details can be found in the Data Collection section.Stringedges.csv**eNodeIDExporter node (country) ID.Integeredges.csv**iNodeIDImporter node (country) ID.Integeredges.csv**expValDiffDifferences in exported values from 2019 to 2020 in thousand USD.Floatedges.csv
Table 2 summarizes the exported value differences (expValDiff) in each product category (prodCat) of the dataset.Table 2. Descriptive statistics of export value differences by product categories.Table 2prodCatNumber of EdgesMinQ1MedianMeanQ3MaxStDevA11,127-3.99E+06-8.71E+005.95E-012.30E+038.11E+012.15E+065.95E+04B8,755-5.65E+06-9.56E+013.61E+002.73E+035.00E+021.73E+068.16E+04C9,989-2.33E+05-7.05E+01-1.05E-013.36E+025.24E+018.27E+051.21E+04D9,875-6.32E+05-5.89E+011.01E-013.24E+028.39E+011.08E+061.88E+04E8,016-3.16E+05-3.31E+013.75E-022.90E+024.21E+016.80E+051.09E+04F8,289-2.13E+05-2.24E+011.40E+008.95E+021.12E+025.29E+051.48E+04G9,390-5.87E+04-3.33E+003.72E+009.48E+031.71E+021.81E+072.36E+05H4,255-1.56E+05-1.85E+02-2.72E+00-4.32E+024.05E+018.24E+045.71E+03
According to Table 2, we can classify medical products into three categories based on their exported value growth. The traded value of protective garments (G), medical test kits (A), and disinfectants and sterilization products (B) increased the most. The average increase in these categories was between 2,303 and 9,482 thousand USD. This is followed by categories C to F, in which the average increase was between approximately 290 and 895 thousand USD. Finally, the international trade of vehicles (H) decreased on average during the investigated time period by 432 thousand USD. The most substantial growth observed in the entire dataset occurred within the protective garments (G) category, between China and the United States. In this category, China escalated its export value to the United States by a staggering 18,072,670 thousand USD. A detailed analysis of the dataset can be found in Kurbucz et al. (2023) [7].
Multilevel network format
3.2
After running the genMux.R script, a new folder named muxNet will be generated with the following directory structure (see Fig. 1).Fig. 1. Directory structure.Fig. 1
Files contained in the muxNet folder do not have headers. Files 1.edges, 2.edges, ..., 8.edges contain the trade network data for the eight product categories (A-H). Each file includes eNodeID, iNodeID, and expValDiff columns, respectively (see Table 1). The configuration file (config.txt) consists of eight rows, each denoting the complete path of an edge file, the title of the edge file (A-H), and the complete path of the layout.txt file. The layout file (layout.txt) includes nodeID, nodeLabel, nodeLat, and nodeLong columns, respectively (see Table 1).
Experimental Design, Materials and Methods
4
Data collection
4.1
Data regarding the trade of COVID-19-related medical products is sourced from the BACI database [1,2].1 BACI annually provides bilateral trade flow data for 200 countries at the product level. Aligning with Kurbucz [12], our focus is specifically on COVID-19-related medical products identified by six-digit codes (HS-6). These are further categorized into eight distinct product categories as follows [3]:
- A:Medical test kits (HS-6: 300215, 382100, 382200, 902780);
- B:Disinfectants and sterilization products (HS-6: 220710, 220890, 284700, 300490, 380894, 841920);
- C:Other medical consumables (HS-6: 280440, 300510, 300590, 300670, 340111, 340120, 392329, 392690, 481890, 901831, 901832);
- D:Other medical devices and equipment (HS-6: 732490, 841319, 901811, 901812, 901890, 902212, 902519, 902780, 902820);
- E:Other medical-related goods (HS-6: 731100, 761300, 842139, 940290);
- F:Oxygen therapy equipment and pulse oximeters (HS-6: 901819, 901839, 901920, 902680);
- G:Protective garments (HS-6: 392620, 401511, 401519, 401590, 481850, 611610, 621010, 621050, 621600, 630790, 650500, 900490, 902000);
- H:Vehicles (HS-6: 870590, 871310, 871390).
Aggregate values for exported medical products are grouped by the abovementioned product categories (A-H) for the years 2019 and 2020. For each product category, the total exported values from country to country in 2019 and 2020 are denoted as and , respectively.
Data processing
4.2
The trading data is represented as a multilevel network (see, e.g., [13]). Multilevel networks include multiple layers that can contain a subset of all available nodes and edges. In our case, the eight product categories form eight layers, nodes are the countries, and directed edges represent their trading activities (from the exporting country to the importing country) in the given product category. The weight of the edges within a product category is determined based on the difference between the values exported in 2020 and 2019 as follows:
where is the aggregated exported value from country to in 2020, measured in thousand United States dollars (USD), .
Formally, the trading data is represented by a graph (G) which is a tuple defined by the sets of nodes ( ), edges ( ), and eight layers ( ) as follows:
where . The applied multilevel data structure is illustrated in Fig. 2.Fig. 2. Applied multilevel data structure (Source: Kurbucz et al. [7]).Fig. 2(Remarks: : node , : edge from node to . Each edge has a weight.)
Note that the directed version of strength (i.e., weighted degree) centrality [14,15] can be employed to assess the magnitude of the change in the total imported and exported values of products for each country between 2019 and 2020, as follows:2
where and and individually measure the in-degree and out-degree centrality of the node , respectively.
Generating multilevel format
4.3
This dataset contains an R script, named genMux.R, that transforms the source edges.csv and nodes.csv into a MuxViz [4,5] R package-compatible dataset. During the run, the R script must be in the same folder as the aforementioned two files. This script is operating system-agnostic, and if it is missing, it will automatically install its only dependency, the rstudioapi R package.
Illustrative Example
4.4
MuxViz [4,5] offers a widespread set of indicators and visualization tools for in-depth analysis of the trading network for COVID-19 medical products. Fig. 3 exemplifies the visualization of strength centralities in MuxViz, presented for each layer of the multilayer network separately.3Fig. 3Growth in the import and export of various product categories (A-H), 2019-2020 (Source: Kurbucz et al. [7]).Fig. 3(Remarks: node color: , from white to red, node size: , from small to big.)
Fig. 3 uses color coding to represent the change in imports of the eight product categories between 2019 and 2020 ( ). White countries indicate a decrease or small increase in imports, while red indicates a relatively large increase. For product categories A to G, more countries appear red than white, suggesting a general rise in imports. Product category G (protective garments) shows the most significant increase, with only a few countries not experiencing a substantial import rise. In contrast, the trade network for healthcare-related vehicles (product category H) suggests a global decline in net exports.
Focusing on the size of the country markers, the change in exports of countries in the given product category from 2019 to 2020 is shown ( ). Smaller markers indicate a decrease or a relatively smaller increase in exports, while larger markers indicate a larger increase. Across all product categories, Europe and Asia have the largest markers, indicating that these regions exported more products than other continents. Regardless of product category, only a few countries show exceptionally high export values compared to other countries. Among these countries, China showed the greatest increase in net exports for all product categories except B and H.
Limitations
Not applicable.
Ethics Statement
The authors have read and follow the ethical requirements for publication in Data in Brief and confirming that the current work does not involve human subjects, animal experiments, or any data collected from social media platforms*.*
Credit Author Statement
Marcell T. Kurbucz, Tibor Keresztély, Szabolcs Szikszai, András Sugár and Zsuzsanna Banász conceptualized the work and contributed to the writing and editing of the manuscript, as well as data visualization. Marcell T. Kurbucz wrote the software and supervised the research.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1G. Gaulier, S. Zignago, BACI: International Trade Database at the Product-Level. The 1994-2007 Version. Working Papers 2010-23, CEPII, 2010. http://www.cepii.fr/CEPII/fr/publications/wp/abstract.asp?No Doc=2726.
- 2CEPII, BACI data. http://www.cepii.fr/CEPII/en/bdd_modele/bdd_modele_item.asp?id=37, 2023 (accessed 29 January 2024).
- 3World Integrated Trade Solution (WITS), Trade Statistics Medical Product related to COVID-19 (HS 6-digit). https://wits.worldbank.org/trade/covid-19-medical-products.aspx, (accessed 29 January 2024).
- 4De Domenico M.Porter M.A.Arenas A.Mux Viz: a tool for multilayer analysis and visualization of networks J. Complex Netw.32201515917610.1093/comnet/cnu 038 · doi ↗
- 5De Domenico M.Multilayer Networks: Analysis and Visualization-Introduction to mux Viz with Rfirst ed.2022 Springer Cham Switzerland 10.1007/978-3-030-75718-2 · doi ↗
- 6M.T. Kurbucz, Global Trading Network of COVID-19 Medical Products Between 2019 and 2020, Mendeley Data V 1. 10.17632/b 7svp 82sdj, https://data.mendeley.com/datasets/b 7svp 82sdj 2024 (accessed 04 February 2024). · doi ↗
- 7Kurbucz M.T.Sugár A.Keresztély T.Analysis of the international trade networks of COVID-19 medical products Appl. Netw. Sci.8120235810.1007/s 41109-023-00586-z · doi ↗
- 8Grassia M.Mangioni G.Schiavo S.Traverso S.Unintended) Consequences of export restrictions on medical goods during the Covid-19 pandemic J. Complex Netw.1012022 cnab 04510.1093/comnet/cnab 045 · doi ↗
