Who With Whom? Learning Optimal Matching Policies
Yagan Hazard, Toru Kitagawa

TL;DR
This paper introduces a method to learn optimal matching policies in two-sided matching problems using empirical optimal transport, aiming to improve welfare outcomes based on observable characteristics.
Contribution
It formulates the matching policy learning as an empirical optimal transport problem and derives welfare regret bounds for the estimated policies.
Findings
The proposed method effectively learns welfare-enhancing matching policies.
Application to job search data shows improved welfare performance.
Theoretical bounds on policy convergence are established.
Abstract
There are many economic contexts where the productivity and welfare performance of institutions and policies depend on who matches with whom. Examples include caseworkers and job seekers in job search assistance programs, medical doctors and patients, teachers and students, attorneys and defendants, and tax auditors and taxpayers, among others. Although reallocating individuals through a change in matching policy can be less costly than training personnel or introducing a new program, methods for learning optimal matching policies and their statistical performance are less studied than methods for other policy interventions. This paper develops a method to learn welfare optimal matching policies for two-sided matching problems in which a planner matches individuals based on the rich set of observable characteristics of the two sides. We formulate the learning problem as an empirical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Voting Systems · Experimental Behavioral Economics Studies · Auction Theory and Applications
