Networks of motifs from sequences of symbols

Roberta Sinatra; Daniele Condorelli; Vito Latora

arXiv:1002.0668·q-bio.MN·October 27, 2010

Networks of motifs from sequences of symbols

Roberta Sinatra, Daniele Condorelli, Vito Latora

PDF

TL;DR

This paper presents a novel method to transform sequences of symbols into weighted directed networks of motifs, enabling analysis of complex data across biological, social, and dynamical systems.

Contribution

The paper introduces a new network-based approach to analyze symbol sequences by identifying motifs and their significant co-occurrences, facilitating diverse applications.

Findings

01

Networks of motifs can correlate sequences with biological functions.

02

Method detects hot topics in social dialogs.

03

Characterizes trajectories of dynamical systems.

Abstract

We introduce a method to convert an ensemble of sequences of symbols into a weighted directed network whose nodes are motifs, while the directed links and their weights are defined from statistically significant co-occurences of two motifs in the same sequence. The analysis of communities of networks of motifs is shown to be able to correlate sequences with functions in the human proteome database, to detect hot topics from online social dialogs, to characterize trajectories of dynamical systems, and might find other useful applications to process large amount of data in various fields.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.