A Dataset on Malicious Paper Bidding in Peer Review
Steven Jecmen, Minji Yoon, Vincent Conitzer, Nihar B. Shah, Fei Fang

TL;DR
This paper introduces a new dataset of malicious and honest paper bids from a mock conference, analyzes bidding strategies, and evaluates baseline detection algorithms to improve peer review integrity.
Contribution
It provides the first publicly available dataset on malicious paper bidding, along with analysis and baseline detection methods for future research.
Findings
Different bidding strategies can successfully manipulate paper assignments.
Simple detection algorithms show baseline performance in identifying malicious bids.
The dataset enables benchmarking for malicious bidding detection methods.
Abstract
In conference peer review, reviewers are often asked to provide "bids" on each submitted paper that express their interest in reviewing that paper. A paper assignment algorithm then uses these bids (along with other data) to compute a high-quality assignment of reviewers to papers. However, this process has been exploited by malicious reviewers who strategically bid in order to unethically manipulate the paper assignment, crucially undermining the peer review process. For example, these reviewers may aim to get assigned to a friend's paper as part of a quid-pro-quo deal. A critical impediment towards creating and evaluating methods to mitigate this issue is the lack of any publicly-available data on malicious paper bidding. In this work, we collect and publicly release a novel dataset to fill this gap, collected from a mock conference activity where participants were instructed to bid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Imbalanced Data Classification Techniques · Expert finding and Q&A systems
