A divide-and-conquer approach for spatio-temporal analysis of large house price data from Greater London
Kapil Gupta, Soudeep Deb

TL;DR
This paper introduces a divide-and-conquer Gaussian process approach for efficient spatio-temporal analysis of large house price datasets, enabling detailed insights and accurate predictions in London’s real estate market.
Contribution
It presents a novel parallelizable methodology combining data partitioning and Wasserstein barycenters for scalable spatio-temporal modeling of large datasets.
Findings
Identified key amenities influencing house prices.
Revealed trend patterns and price-emission relationships.
Achieved high predictive accuracy with computational efficiency.
Abstract
Statistical research in real estate markets, particularly in understanding the spatio-temporal dynamics of house prices, has garnered significant attention in recent times. Although Bayesian methods are common in spatio-temporal modeling, standard Markov chain Monte Carlo (MCMC) techniques are usually slow for large datasets such as house price data. To tackle this problem, we propose a divide-and-conquer spatio-temporal modeling approach. This method involves partitioning the data into multiple subsets and applying an appropriate Gaussian process model to each subset in parallel. The results from each subset are then combined using the Wasserstein barycenter technique to obtain the global parameters for the original problem. The proposed methodology allows for multiple observations per spatial and time unit, thereby offering added benefits for practitioners. As a real-life application,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
