# Discovering bursts revisited: guaranteed optimization of the model   parameters

**Authors:** Nikolaj Tatti

arXiv: 1902.01727 · 2019-02-06

## TL;DR

This paper revisits burst detection in data mining, proposing an optimized likelihood-based parameter selection method and efficient approximation algorithms, leading to improved detection of previously missed bursts.

## Contribution

It introduces a likelihood optimization framework for burst detection parameters and provides efficient approximation schemes for practical implementation.

## Key findings

- Optimized parameter selection improves burst detection accuracy.
- Proposed algorithms efficiently approximate the optimal parameters.
- Empirical results show detection of previously missed bursts.

## Abstract

One of the classic data mining tasks is to discover bursts, time intervals, where events occur at abnormally high rate. In this paper we revisit Kleinberg's seminal work, where bursts are discovered by using exponential distribution with a varying rate parameter: the regions where it is more advantageous to set the rate higher are deemed bursty. The model depends on two parameters, the initial rate and the change rate. The initial rate, that is, the rate that is used when there are no burstiness was set to the average rate over the whole sequence. The change rate is provided by the user.   We argue that these choices are suboptimal: it leads to worse likelihood, and may lead to missing some existing bursts. We propose an alternative problem setting, where the model parameters are selected by optimizing the likelihood of the model. While this tweak is trivial from the problem definition point of view, this changes the optimization problem greatly. To solve the problem in practice, we propose efficient ($1 + \epsilon$) approximation schemes. Finally, we demonstrate empirically that with this setting we are able to discover bursts that would have otherwise be undetected.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.01727/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/1902.01727/full.md

## References

21 references — full list in the complete paper: https://tomesphere.com/paper/1902.01727/full.md

---
Source: https://tomesphere.com/paper/1902.01727