# Using data-compressors for statistical analysis of problems on   homogeneity testing and classification

**Authors:** Boris Ryabko, Andrey Guskov, Irina Selivanova

arXiv: 1701.04028 · 2017-01-17

## TL;DR

This paper demonstrates how data compressors can be utilized to develop classical statistical methods for homogeneity testing and classification, bridging the gap between data compression and statistical analysis.

## Contribution

It introduces a novel approach to apply data compressors within the framework of mathematical statistics for homogeneity testing and classification.

## Key findings

- Data compressors can be effectively used for statistical analysis.
- Classical statistical methods can be reformulated using data compression techniques.
- The approach bridges the gap between text analysis and mathematical statistics.

## Abstract

Nowadays data compressors are applied to many problems of text analysis, but many such applications are developed outside of the framework of mathematical statistics. In this paper we overcome this obstacle and show how several methods of classical mathematical statistics can be developed based on applications of the data compressors.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1701.04028/full.md

## References

21 references — full list in the complete paper: https://tomesphere.com/paper/1701.04028/full.md

---
Source: https://tomesphere.com/paper/1701.04028