# A Framework for Understanding Sources of Harm throughout the Machine   Learning Life Cycle

**Authors:** Harini Suresh, John V. Guttag

arXiv: 1901.10002 · 2021-12-03

## TL;DR

This paper presents a comprehensive framework identifying seven sources of harm in the machine learning life cycle to improve understanding, communication, and mitigation of potential negative societal impacts.

## Contribution

It introduces a novel framework that systematically categorizes sources of harm across data collection, development, and deployment stages of ML.

## Key findings

- Identifies seven distinct sources of downstream harm in ML.
- Facilitates better communication about harm sources.
- Supports targeted mitigation strategies.

## Abstract

As machine learning (ML) increasingly affects people and society, awareness of its potential unwanted consequences has also grown. To anticipate, prevent, and mitigate undesirable downstream consequences, it is critical that we understand when and how harm might be introduced throughout the ML life cycle. In this paper, we provide a framework that identifies seven distinct potential sources of downstream harm in machine learning, spanning data collection, development, and deployment. In doing so, we aim to facilitate more productive and precise communication around these issues, as well as more direct, application-grounded ways to mitigate them.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.10002/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/1901.10002/full.md

## References

48 references — full list in the complete paper: https://tomesphere.com/paper/1901.10002/full.md

---
Source: https://tomesphere.com/paper/1901.10002