MobilityMirror: Bias-Adjusted Transportation Datasets
Luke Rodriguez, Babak Salimi, Haoyue Ping, Julia Stoyanovich, Bill, Howe

TL;DR
This paper presents a method for creating bias-adjusted synthetic transportation datasets that preserve utility while removing discriminatory causal relationships, facilitating privacy protection and fair data sharing.
Contribution
It introduces a causality-based algorithm for bias removal in origin-destination data, enabling privacy-preserving and equitable transportation data publication.
Findings
The adjusted datasets retain utility comparable to original data.
Biases related to discrimination and contractual constraints are effectively removed.
The method is validated on real bike share and taxi datasets from Seattle and New York.
Abstract
We describe customized synthetic datasets for publishing mobility data. Private companies are providing new transportation modalities, and their data is of high value for integrative transportation research, policy enforcement, and public accountability. However, these companies are disincentivized from sharing data not only to protect the privacy of individuals (drivers and/or passengers), but also to protect their own competitive advantage. Moreover, demographic biases arising from how the services are delivered may be amplified if released data is used in other contexts. We describe a model and algorithm for releasing origin-destination histograms that removes selected biases in the data using causality-based methods. We compute the origin-destination histogram of the original dataset then adjust the counts to remove undesirable causal relationships that can lead to discrimination…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
