Approaches to the classification of complex systems: Words, texts, and more
Andrij Rovenchak

TL;DR
This paper explores the classification of complex systems like texts and genomes using statistical physics analogies, introducing parameters such as entropy, temperature, and chemical potential to analyze their structure and differences.
Contribution
It develops novel methods to classify complex systems by applying physical analogies to linguistic and genomic data, expanding the toolkit for analyzing such systems.
Findings
Parameters like entropy and temperature can classify texts and genomes.
Linguistic analogies enable the application of statistical physics concepts.
Entropy's role as a discriminating parameter is context-dependent.
Abstract
The Chapter starts with introductory information about quantitative linguistics notions, like rank--frequency dependence, Zipf's law, frequency spectra, etc. Similarities in distributions of words in texts with level occupation in quantum ensembles hint at a superficial analogy with statistical physics. This enables one to define various parameters for texts based on this physical analogy, including "temperature", "chemical potential", entropy, and some others. Such parameters provide a set of variables to classify texts serving as an example of complex systems. Moreover, texts are perhaps the easiest complex systems to collect and analyze. Similar approaches can be developed to study, for instance, genomes due to well-known linguistic analogies. We consider a couple of approaches to define nucleotide sequences in mitochondrial DNAs and viral RNAs and demonstrate their possible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFractal and DNA sequence analysis · Machine Learning in Bioinformatics
MethodsGumbel Softmax · Differentiable Neural Architecture Search
