A Dataset on Malicious Paper Bidding in Peer Review

Steven Jecmen; Minji Yoon; Vincent Conitzer; Nihar B. Shah; Fei Fang

arXiv:2207.02303·cs.CR·March 14, 2023·1 cites

A Dataset on Malicious Paper Bidding in Peer Review

Steven Jecmen, Minji Yoon, Vincent Conitzer, Nihar B. Shah, Fei Fang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new dataset of malicious and honest paper bids from a mock conference, analyzes bidding strategies, and evaluates baseline detection algorithms to improve peer review integrity.

Contribution

It provides the first publicly available dataset on malicious paper bidding, along with analysis and baseline detection methods for future research.

Findings

01

Different bidding strategies can successfully manipulate paper assignments.

02

Simple detection algorithms show baseline performance in identifying malicious bids.

03

The dataset enables benchmarking for malicious bidding detection methods.

Abstract

In conference peer review, reviewers are often asked to provide "bids" on each submitted paper that express their interest in reviewing that paper. A paper assignment algorithm then uses these bids (along with other data) to compute a high-quality assignment of reviewers to papers. However, this process has been exploited by malicious reviewers who strategically bid in order to unethically manipulate the paper assignment, crucially undermining the peer review process. For example, these reviewers may aim to get assigned to a friend's paper as part of a quid-pro-quo deal. A critical impediment towards creating and evaluating methods to mitigate this issue is the lack of any publicly-available data on malicious paper bidding. In this work, we collect and publicly release a novel dataset to fill this gap, collected from a mock conference activity where participants were instructed to bid…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sjecmen/malicious_bidding_dataset
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Imbalanced Data Classification Techniques · Expert finding and Q&A systems