A Generalized Hamming Distance of Sequence Patterns
Pengyu Liu, Jingzhou Na

TL;DR
This paper introduces a generalized Hamming distance for sequence patterns, considering equivalence classes under symbol relabeling, and provides methods to compute maximal and pairwise distances.
Contribution
It defines a new distance measure for sequence patterns based on equivalence classes and offers formulas for maximal and pairwise distances.
Findings
Derived the maximal distance for sets of sequence patterns.
Provided exact calculation methods for the distance between two sequence patterns.
Extended Hamming distance to equivalence classes of sequences.
Abstract
We define sequence patterns of length and level to be equivalence classes of sequences that have elements from the set of integer symbols with no restriction on repetition, where the equivalence relation is induced by symbol relabeling without swapping positions of symbols. We define a distance for a set of sequence patterns of length and level by generalizing the Hamming distance between sequences. We compute the maximal distance for sequence patterns of length and level and demonstrate how to calculate the exact distance between a pair of length- level- sequence patterns.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicssemigroups and automata theory · Algorithms and Data Compression · graph theory and CDMA systems
