BioMedJImpact: A Comprehensive Dataset and LLM Pipeline for AI Engagement and Scientific Impact Analysis of Biomedical Journals
Ruiyu Wang, Yuzhang Xie, Xiao Hu, Carl Yang, Jiaying Lu

TL;DR
BioMedJImpact introduces a large-scale biomedical dataset with an innovative LLM pipeline to analyze how collaboration and AI engagement influence journal impact, validated through human evaluation and applicable for scientometric studies.
Contribution
The paper presents a comprehensive biomedical dataset and a novel three-stage LLM pipeline for measuring AI engagement, enabling scalable impact analysis.
Findings
Higher collaboration correlates with greater citation impact.
AI engagement increasingly influences journal prestige.
Validated LLM pipeline accurately detects AI relevance.
Abstract
Assessing journal impact is central to scholarly communication, yet existing open resources rarely capture how collaboration structures and artificial intelligence (AI) research jointly shape venue prestige in biomedicine. We present BioMedJImpact, a large-scale, biomedical-oriented dataset designed to advance journal-level analysis of scientific impact and AI engagement. Built from 1.74 million PubMed Central articles across 2,744 journals, BioMedJImpact integrates bibliometric indicators, collaboration features, and LLM-derived semantic indicators for AI engagement. Specifically, the AI engagement feature is extracted through a reproducible three-stage LLM pipeline that we propose. Using this dataset, we analyze how collaboration intensity and AI engagement jointly influence scientific impact across pre- and post-pandemic periods (2016-2019, 2020-2023). Two consistent trends emerge:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · scientometrics and bibliometrics research · Academic Publishing and Open Access
