Derivative of a hypergraph as a tool for linguistic pattern analysis
Angeles Criado-Alonso, David Aleja, Miguel Romance, Regino Criado

TL;DR
This paper introduces a novel hypergraph derivative model for analyzing linguistic patterns, enabling detection of author style, language level, and text similarity, with applications in stylometry and forensic linguistics.
Contribution
It presents a new mathematical framework using hypergraphs and derivatives to analyze mesoscopic text relationships, advancing linguistic pattern analysis methods.
Findings
Effective in identifying author style
Assists in language level assessment
Potential for plagiarism detection
Abstract
The search for linguistic patterns, stylometry and forensic linguistics have in the theory of complex networks, their structures and associated mathematical tools, allies with which to model and analyze texts. In this paper we present a new model supported by several mathematical structures such as the hypergraphs or the concept of derivative graph to introduce a new methodology able to analyze the mesoscopic relationships between sentences, paragraphs, chapters and texts, focusing not only in a quantitative index but also in a new mathematical structure that will be of singular help to both: detecting the style of an author and determining the language level of a text. In addition, these new mathematical structures may be useful to detect similarity and dissimilarity in texts and, eventually, even plagiarism.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling
