Don't Hash Me Like That: Exposing and Mitigating Hash-Induced Unfairness in Local Differential Privacy
Berkay Kemal Balioglu, Alireza Khodaie, Mehmet Emre Gursoy

TL;DR
This paper investigates how hash functions in local differential privacy protocols can cause unfairness among users and proposes a new method, F-OLH, to mitigate this issue effectively.
Contribution
It reveals the unfairness caused by hash function choices in LDP and introduces F-OLH, a novel approach to enforce fairness in hash function selection.
Findings
Hash functions can cause significant disparities in vulnerability to attacks.
F-OLH effectively reduces hash-induced unfairness.
F-OLH maintains acceptable computational overheads.
Abstract
Local differential privacy (LDP) has become a widely accepted framework for privacy-preserving data collection. In LDP, many protocols rely on hash functions to implement user-side encoding and perturbation. However, the security and privacy implications of hash function selection have not been previously investigated. In this paper, we expose that the hash functions may act as a source of unfairness in LDP protocols. We show that although users operate under the same protocol and privacy budget, differences in hash functions can lead to significant disparities in vulnerability to inference and poisoning attacks. To mitigate hash-induced unfairness, we propose Fair-OLH (F-OLH), a variant of OLH that enforces an entropy-based fairness constraint on hash function selection. Experiments show that F-OLH is effective in mitigating hash-induced unfairness under acceptable time overheads.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy, Security, and Data Protection · Internet Traffic Analysis and Secure E-voting · Privacy-Preserving Technologies in Data
