TL;DR
This paper introduces extended-alphabet finite-context models (xaFCMs), a generalization of FCMs that predicts multiple symbols simultaneously, improving efficiency and accuracy in applications like DNA analysis and ECG biometric identification.
Contribution
The paper presents xaFCMs, a novel extension of FCMs that models multiple symbols at once, reducing memory and computation while maintaining or improving accuracy.
Findings
xaFCMs require less memory and computational time.
xaFCMs achieve equal or better accuracy than traditional FCMs.
Successful application in DNA sequence comparison and ECG biometric identification.
Abstract
The Normalized Relative Compression (NRC) is a recent dissimilarity measure, related to the Kolmogorov Complexity. It has been successfully used in different applications, like DNA sequences, images or even ECG (electrocardiographic) signal. It uses a compressor that compresses a target string using exclusively the information contained in a reference string. One possible approach is to use finite-context models (FCMs) to represent the strings. A finite-context model calculates the probability distribution of the next symbol, given the previous symbols. In this paper, we introduce a generalization of the FCMs, called extended-alphabet finite-context models (xaFCM), that calculates the probability of occurrence of the next symbols, given the previous symbols. We perform experiments on two different sample applications using the xaFCMs and the NRC measure: ECG biometric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
