# Combining Crowd and Machines for Multi-predicate Item Screening

**Authors:** Evgeny Krivosheev, Fabio Casati, Marcos Baez, Boualem Benatallah

arXiv: 1904.00714 · 2019-04-02

## TL;DR

This paper presents hybrid algorithms combining crowd and machine classifiers for efficient multi-predicate item screening, optimizing performance and cost across various classification scenarios.

## Contribution

It introduces novel methods for managing cost-accuracy trade-offs and combining classifiers without prior accuracy knowledge, improving screening efficiency.

## Key findings

- Significant cost and accuracy improvements over existing algorithms.
- Effective management of classifier accuracy estimation and ensemble training.
- Demonstrated applicability across multiple domains.

## Abstract

This paper discusses how crowd and machine classifiers can be efficiently combined to screen items that satisfy a set of predicates. We show that this is a recurring problem in many domains, present machine-human (hybrid) algorithms that screen items efficiently and estimate the gain over human-only or machine-only screening in terms of performance and cost. We further show how, given a new classification problem and a set of classifiers of unknown accuracy for the problem at hand, we can identify how to manage the cost-accuracy trade off by progressively determining if we should spend budget to obtain test data (to assess the accuracy of the given classifiers), or to train an ensemble of classifiers, or whether we should leverage the existing machine classifiers with the crowd, and in this case how to efficiently combine them based on their estimated characteristics to obtain the classification. We demonstrate that the techniques we propose obtain significant cost/accuracy improvements with respect to the leading classification algorithms.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.00714/full.md

## Figures

17 figures with captions in the complete paper: https://tomesphere.com/paper/1904.00714/full.md

## References

46 references — full list in the complete paper: https://tomesphere.com/paper/1904.00714/full.md

---
Source: https://tomesphere.com/paper/1904.00714