# Don't Look at the Data! How Differential Privacy Reconfigures the   Practices of Data Science

**Authors:** Jayshree Sarathy, Sophia Song, Audrey Haque, Tania Schlatter, Salil, Vadhan

arXiv: 2302.11775 · 2023-02-24

## TL;DR

This paper explores how differential privacy impacts data science practices through interviews, revealing benefits for data access but also challenges and ethical questions in workflow integration.

## Contribution

It provides empirical insights into data practitioners' perceptions of differential privacy and offers suggestions for better integration into data science workflows.

## Key findings

- DP enables wider access to sensitive data
- DP introduces challenges at all data science stages
- Ethical and governance issues emerge with DP use

## Abstract

Across academia, government, and industry, data stewards are facing increasing pressure to make datasets more openly accessible for researchers while also protecting the privacy of data subjects. Differential privacy (DP) is one promising way to offer privacy along with open access, but further inquiry is needed into the tensions between DP and data science. In this study, we conduct interviews with 19 data practitioners who are non-experts in DP as they use a DP data analysis prototype to release privacy-preserving statistics about sensitive data, in order to understand perceptions, challenges, and opportunities around using DP. We find that while DP is promising for providing wider access to sensitive datasets, it also introduces challenges into every stage of the data science workflow. We identify ethics and governance questions that arise when socializing data scientists around new privacy constraints and offer suggestions to better integrate DP and data science.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2302.11775/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/2302.11775/full.md

## References

52 references — full list in the complete paper: https://tomesphere.com/paper/2302.11775/full.md

---
Source: https://tomesphere.com/paper/2302.11775