# Mapping the evolution of cross-Strait relations via global news big data (2014–2023): An analysis integrating GDELT and machine learning

**Authors:** Shengjie Shi, Belen Chen, Ziyi Guang, Derong Kong

PMC · DOI: 10.1371/journal.pone.0342755 · PLOS One · 2026-02-13

## TL;DR

This study uses global news data and machine learning to analyze how cross-Strait relations have been portrayed in media from 2014 to 2023.

## Contribution

The study introduces a novel integration of GDELT data and machine learning to examine media framing of cross-Strait relations.

## Key findings

- Media coverage of cross-Strait relations has shifted toward negative and conflict-focused narratives.
- Western and Taiwanese media dominate agenda-setting, while Mainland Chinese outlets are less visible.
- Topic modeling reveals a thematic evolution from low-politics to high-politics narratives emphasizing conflict and rivalry.

## Abstract

This study investigates global news media representations of cross-Strait relations from 2014 to 2023 using the GDELT database, framed within the mediatization of politics. Combining large-scale event data and computational text analysis, it offers a multi-level analysis of representational patterns, structural inequalities, and framing dynamics. Six indicators track longitudinal trends including four continuous indices and two event type distributions, show attention spikes during major political events and a discursive shift toward negative, conflict-focused coverage, even as low-intensity communicative events dominate. Structurally, a source-domain analysis of 50 high-impact outlets employs a six-dimensional Deviation Index to evaluate differences in visibility and event production preferences. Results reveal a concentrated discourse, with Western and Taiwanese media occupying more central positions in agenda-setting processes, with Mainland Chinese outlets appearing comparatively less visible. Textually, topic modeling uncovers five key frames, reflecting a discursive thematic evolution from event-driven, low-politics coverage to high-politics narratives emphasizing conflicts, ideological divides, and great-power rivalry. Complementary sentiment analysis of news headlines using large language model-based tools indicates the persistent dominance of negative actor-sentiment framing in global coverage. Overall, the study underscores an increasingly securitized and asymmetrical pattern of global media representation of cross-Strait relations. Theoretically, the study extends the applicability of mediatization of politics framework to the analysis of cross-Strait communication. Methodologically, it illustrates integrating large-scale event data with machine-learning techniques to examine international news framing in a highly politicized geopolitical context.

## Full-text entities

- **Diseases:** LLM (MESH:D007806)
- **Species:** Homo sapiens (human, species) [taxon 9606], Helianthus annuus (common sunflower, species) [taxon 4232]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12904412/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12904412/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/PMC12904412/full.md

---
Source: https://tomesphere.com/paper/PMC12904412