# Information Source Detection with Limited Time Knowledge

**Authors:** Xuecheng Liu, Luoyi Fu, Bo Jiang, Xiaojun Lin, Xinbing Wang

arXiv: 1905.12913 · 2019-06-02

## TL;DR

This paper proposes an efficient infection source detection method using limited timestamp data and network topology, providing provable guarantees and demonstrating superior performance on synthetic and real datasets.

## Contribution

It introduces a linear integer programming approach for source detection on trees with performance guarantees under limited timestamps, extending to general graphs.

## Key findings

- The estimator is always on a candidate path, improving efficiency.
- Performance guarantees show bounded error distance on regular trees.
- Experimental results confirm superior accuracy on synthetic and real data.

## Abstract

This paper investigates the problem of utilizing network topology and partial timestamps to detect the information source in a network. The problem incurs prohibitive cost under canonical maximum likelihood estimation (MLE) of the source due to the exponential number of possible infection paths. Our main idea of source detection, however, is to approximate the MLE by an alternative infection path based estimator, the essence of which is to identify the most likely infection path that is consistent with observed timestamps. The source node associated with that infection path is viewed as the estimated source $\hat{v}$. We first study the case of tree topology, where by transforming the infection path based estimator into a linear integer programming, we find a reduced search region that remarkably improves the time efficiency. Within this reduced search region, the estimator $\hat{v}$ is provably always on a path which we term as \emph{candidate path}. This notion enables us to analyze the distribution of $d(v^{\ast},\hat{v})$, the error distance between $\hat{v}$ and the true source $v^{\ast}$, on arbitrary tree, which allows us to obtain for the first time, in the literature provable performance guarantee of the estimator under limited timestamps. Specifically, on the infinite $g$-regular tree with uniform sampled timestamps, we get a refined performance guarantee in the sense of a constant bounded $d(v^{\ast},\hat{v})$. By virtue of time labeled BFS tree, the estimator still performs fairly well when extended to more general graphs. Experiments on both synthetic and real datasets further demonstrate the superior performance of our proposed algorithms.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.12913/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/1905.12913/full.md

## References

23 references — full list in the complete paper: https://tomesphere.com/paper/1905.12913/full.md

---
Source: https://tomesphere.com/paper/1905.12913