Evolution of semantic networks in biomedical texts
Lucy R. Chai, Danielle S. Bassett

TL;DR
This study investigates the hierarchical structure of semantic networks in scientific texts, revealing how their organization evolves during manuscript revisions and varies with collaboration, reflecting a balance between complexity and efficiency.
Contribution
It demonstrates that semantic networks in scientific articles exhibit Rentian scaling and that the scaling exponent changes systematically during revisions and across collaborative contexts.
Findings
Semantic networks display clear Rentian scaling.
The Rent exponent varies across manuscript revisions.
Final Rent exponent negatively correlates with number of authors.
Abstract
Language is hierarchically organized: words are built into phrases, sentences, and paragraphs to represent complex ideas. Here we ask whether the organization of language in written text displays the fractal hierarchical architecture common in systems optimized for efficient information transmission. We test the hypothesis that the expositional structure of scientific research articles displays Rentian scaling, and that the exponent of the scaling law changes as the article's information transmission capacity changes. Using 32 scientific manuscripts - each containing between three and 26 iterations of revision - we construct semantic networks in which nodes represented unique words in each manuscript, and edges connect nodes if two words appeared within the same 5-word window. We show that these semantic networks display clear Rentian scaling, and that the Rent exponent varies over the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
