Confidence-weighted integration of human and machine judgments for superior decision-making
Felipe Y\'a\~nez, Xiaoliang Luo, Omar Valerio Minero, Bradley C. Love

TL;DR
This paper introduces a Bayesian logistic regression method to effectively combine human and machine judgments, leveraging confidence calibration and diversity to improve decision-making in tasks like image classification and neuroscience forecasting.
Contribution
It presents a simple, scalable approach for integrating confidence-weighted judgments from humans and machines, enhancing team performance beyond individual capabilities.
Findings
Combining human and machine judgments improves accuracy.
The method is effective across diverse tasks.
Confidence calibration and diversity are key to success.
Abstract
Large language models (LLMs) can surpass humans in certain forecasting tasks. What role does this leave for humans in the overall decision process? One possibility is that humans, despite performing worse than LLMs, can still add value when teamed with them. A human and machine team can surpass each individual teammate when team members' confidence is well-calibrated and team members diverge in which tasks they find difficult (i.e., calibration and diversity are needed). We simplified and extended a Bayesian approach to combining judgments using a logistic regression framework that integrates confidence-weighted judgments for any number of team members. Using this straightforward method, we demonstrated its effectiveness in both image classification and neuroscience forecasting tasks. Combining human judgments with one or more machines consistently improved overall team performance. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Criteria Decision Making · Human-Automation Interaction and Safety · Forecasting Techniques and Applications
MethodsLogistic Regression
