Detecting Privileged Documents by Ranking Connected Network Entities

Jianping Zhang; Han Qin; Nathaniel Huber-Fliflet

arXiv:2512.08073·cs.IR·December 10, 2025

Detecting Privileged Documents by Ranking Connected Network Entities

Jianping Zhang, Han Qin, Nathaniel Huber-Fliflet

PDF

Open Access

TL;DR

This paper introduces a link analysis method that constructs a network of human entities from email metadata to identify privileged documents by ranking entities based on their interactions with legal professionals.

Contribution

It proposes a novel network-based scoring algorithm that leverages entity interactions to improve privileged document detection in email data.

Findings

01

Effective ranking of legal entities for privileged document detection

02

Improved identification accuracy over baseline methods

03

Demonstrated success on experimental email datasets

Abstract

This paper presents a link analysis approach for identifying privileged documents by constructing a network of human entities derived from email header metadata. Entities are classified as either counsel or non-counsel based on a predefined list of known legal professionals. The core assumption is that individuals with frequent interactions with lawyers are more likely to participate in privileged communications. To quantify this likelihood, an algorithm assigns a score to each entity within the network. By utilizing both entity scores and the strength of their connections, the method enhances the identification of privileged documents. Experimental results demonstrate the algorithm's effectiveness in ranking legal entities for privileged document detection.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Topic Modeling · Advanced Graph Neural Networks