A machine learning model to identify corruption in M\'exico's public procurement contracts
Andr\'es Aldana, Andrea Falc\'on-Cort\'es, Hern\'an Larralde

TL;DR
This paper introduces a machine learning ensemble model called hyper-forest to identify corrupt public procurement contracts in Mexico, emphasizing relationships between buyers and suppliers as key predictors.
Contribution
The paper presents a novel ensemble machine learning model for detecting corruption in public procurement, highlighting the importance of relational features over contract-specific ones.
Findings
The model correctly detects most corrupt and non-corrupt contracts.
Relational features between buyers and suppliers are the most critical predictors.
The method is adaptable to other countries' data.
Abstract
The costs and impacts of government corruption range from impairing a country's economic growth to affecting its citizens' well-being and safety. Public contracting between government dependencies and private sector instances, referred to as public procurement, is a fertile land of opportunity for corrupt practices, generating substantial monetary losses worldwide. Thus, identifying and deterring corrupt activities between the government and the private sector is paramount. However, due to several factors, corruption in public procurement is challenging to identify and track, leading to corrupt practices going unnoticed. This paper proposes a machine learning model based on an ensemble of random forest classifiers, which we call hyper-forest, to identify and predict corrupt contracts in M\'exico's public procurement data. This method's results correctly detect most of the corrupt and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCorruption and Economic Development
