The Identity Fragmentation Bias

Tesary Lin; Sanjog Misra

arXiv:2008.12849·econ.EM·August 28, 2023·1 cites

The Identity Fragmentation Bias

Tesary Lin, Sanjog Misra

PDF

Open Access

TL;DR

This paper investigates the bias caused by fragmented consumer identity data across multiple devices, revealing complex biases that can distort behavioral estimates and evaluating correction methods.

Contribution

It provides a formal framework to analyze identity fragmentation bias, showing it can cause unpredictable biases including upward bias and sign reversals.

Findings

01

Bias can be unbounded and unpredictable

02

Standard correction methods have varying effectiveness

03

Experimental settings can also exhibit bias reversals

Abstract

Consumers interact with firms across multiple devices, browsers, and machines; these interactions are often recorded with different identifiers for the same consumer. The failure to correctly match different identities leads to a fragmented view of exposures and behaviors. This paper studies the identity fragmentation bias, referring to the estimation bias resulted from using fragmented data. Using a formal framework, we decompose the contributing factors of the estimation bias caused by data fragmentation and discuss the direction of bias. Contrary to conventional wisdom, this bias cannot be signed or bounded under standard assumptions. Instead, upward biases and sign reversals can occur even in experimental settings. We then compare several corrective measures, and discuss their respective advantages and caveats.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Opinion Dynamics and Social Influence · Privacy, Security, and Data Protection