# High-resolution home location prediction from tweets using deep learning   with dynamic structure

**Authors:** Meysam Ghaffari, Ashok Srinivasan, Xiuwen Liu

arXiv: 1902.03111 · 2019-07-09

## TL;DR

This paper presents a deep learning approach with a dynamic two-phase structure to accurately predict high-resolution home locations from social media tweets, significantly outperforming previous methods in accuracy and error reduction.

## Contribution

The paper introduces a novel two-phase deep learning framework combining random forests and neural networks for high-resolution home location prediction from social media data.

## Key findings

- Achieved over 90% accuracy on a large dataset.
- Reduced high-resolution prediction error from over 21% to less than 8%.
- Outperformed existing high-resolution methods in accuracy.

## Abstract

Timely and high-resolution estimates of the home locations of a sufficiently large subset of the population are critical for effective disaster response and public health intervention, but this is still an open problem. Conventional data sources, such as census and surveys, have a substantial time lag and cannot capture seasonal trends. Recently, social media data has been exploited to address this problem by leveraging its large user-base and real-time nature. However, inherent sparsity and noise, along with large estimation uncertainty in home locations, have limited their effectiveness. Consequently, much of previous research has aimed only at a coarse spatial resolution, with accuracy being limited for high-resolution methods. In this paper, we develop a deep-learning solution that uses a two-phase dynamic structure to deal with sparse and noisy social media data. In the first phase, high recall is achieved using a random forest, producing more balanced home location candidates. Then two deep neural networks are used to detect home locations with high accuracy. We obtained over 90% accuracy for large subsets on a commonly used dataset. Compared to other high-resolution methods, our approach yields up to 60% error reduction by reducing high-resolution home prediction error from over 21% to less than 8%. Systematic comparisons show that our method gives the highest accuracy both for the entire sample and for subsets. Evaluation on a real-world public health problem further validates the effectiveness of our approach.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.03111/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1902.03111/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/1902.03111/full.md

---
Source: https://tomesphere.com/paper/1902.03111