# Statistical inference of probabilistic origin-destination demand using   day-to-day traffic data

**Authors:** Wei Ma, Zhen Qian

arXiv: 1901.10068 · 2024-12-20

## TL;DR

This paper introduces a novel statistical framework for estimating probabilistic origin-destination demand and network flow distributions using extensive day-to-day traffic data, accounting for various sources of variability.

## Contribution

It develops an iterative generalized least squares method with Lasso regularization to estimate O-D demand mean and covariance, improving interpretability and computational efficiency.

## Key findings

- Framework accurately estimates O-D demand mean and variance.
- Method converges quickly on multiple network scales.
- Lasso regularization balances bias and variance in covariance estimation.

## Abstract

Recent transportation network studies on uncertainty and reliability call for modeling the probabilistic O-D demand and probabilistic network flow. Making the best use of day-to-day traffic data collected over many years, this paper develops a novel theoretical framework for estimating the mean and variance/covariance matrix of O-D demand considering the day-to-day variation induced by travelers' independent route choices. It also estimates the probability distributions of link/path flow and their travel cost where the variance stems from three sources, O-D demand, route choice and unknown errors. The framework estimates O-D demand mean and variance/covariance matrix iteratively, also known as iterative generalized least squares (IGLS) in statistics. Lasso regularization is employed to obtain sparse covariance matrix for better interpretation and computational efficiency. Though the probabilistic O-D estimation (ODE) works with a much larger solution space than the deterministic ODE, we show that its estimator for O-D demand mean is no worse than the best possible estimator by an error that reduces with the increase in sample size. The probabilistic ODE is examined on two small networks and two real-world large-scale networks. The solution converges quickly under the IGLS framework. In all those experiments, the results of the probabilistic ODE are compelling, satisfactory and computationally plausible. Lasso regularization on the covariance matrix estimation leans to underestimate most of variance/covariance entries. A proper Lasso penalty ensures a good trade-off between bias and variance of the estimation.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.10068/full.md

## Figures

22 figures with captions in the complete paper: https://tomesphere.com/paper/1901.10068/full.md

## References

100 references — full list in the complete paper: https://tomesphere.com/paper/1901.10068/full.md

---
Source: https://tomesphere.com/paper/1901.10068