Comparison of the language networks from literature and blogs
Sabina \v{S}i\v{s}ovi\'c, Sanda Martin\v{c}i\'c-Ip\v{s}i\'c, Ana, Me\v{s}trovi\'c

TL;DR
This study compares linguistic co-occurrence networks from literature and blogs, revealing structural differences at both global and local levels using network analysis metrics.
Contribution
It introduces a comparative analysis of linguistic networks from different text genres, highlighting structural differences based on network metrics and selectivity.
Findings
Differences in network density and clustering between literature and blogs.
Distinct in-degree and out-degree distributions for the two text types.
Selectivity analysis reveals genre-specific network structures.
Abstract
In this paper we present the comparison of the linguistic networks from literature and blog texts. The linguistic networks are constructed from texts as directed and weighted co-occurrence networks of words. Words are nodes and links are established between two nodes if they are directly co-occurring within the sentence. The comparison of the networks structure is performed at global level (network) in terms of: average node degree, average shortest path length, diameter, clustering coefficient, density and number of components. Furthermore, we perform analysis on the local level (node) by comparing the rank plots of in and out degree, strength and selectivity. The selectivity-based results point out that there are differences between the structure of the networks constructed from literature and blogs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
