Classification Protocols with Minimal Disclosure

Jinshuo Dong; Jason Hartline; Aravindan Vijayaraghavan

arXiv:2209.02690·cs.CR·September 7, 2022

Classification Protocols with Minimal Disclosure

Jinshuo Dong, Jason Hartline, Aravindan Vijayaraghavan

PDF

TL;DR

This paper introduces a multi-party classification protocol ensuring minimal disclosure of non-responsive documents, facilitating secure and efficient legal document review with formal guarantees for linear classifiers.

Contribution

It presents a novel multi-party classification protocol that guarantees minimal necessary disclosure and can be integrated into machine learning frameworks for secure document labeling.

Findings

01

Protocol guarantees minimal non-responsive disclosure

02

Embeds into machine learning for automated labeling

03

Equivalent to standard classification under certain conditions

Abstract

We consider multi-party protocols for classification that are motivated by applications such as e-discovery in court proceedings. We identify a protocol that guarantees that the requesting party receives all responsive documents and the sending party discloses the minimal amount of non-responsive documents necessary to prove that all responsive documents have been received. This protocol can be embedded in a machine learning framework that enables automated labeling of points and the resulting multi-party protocol is equivalent to the standard one-party classification problem (if the one-party classification problem satisfies a natural independence-of-irrelevant-alternatives property). Our formal guarantees focus on the case where there is a linear classifier that correctly partitions the documents.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.