Evaluation of Interactive Machine Learning Systems

Nadia Boukhelifa; Anastasia Bezerianos; Evelyne Lutton

arXiv:1801.07964·cs.AI·January 25, 2018

Evaluation of Interactive Machine Learning Systems

Nadia Boukhelifa, Anastasia Bezerianos, Evelyne Lutton

PDF

TL;DR

This paper discusses the challenges in evaluating interactive machine learning systems, emphasizing the importance of combining algorithmic and human-centered assessments to better understand co-adaptation and improve system design.

Contribution

It advocates for a dual validation approach and illustrates this with a visual analytics application, highlighting the benefits of integrating human-centered evaluation with algorithm analysis.

Findings

01

Human-centered evaluation complements algorithmic analysis.

02

Combining validation methods addresses the 'black-box' issue.

03

Human-computer interaction methods are crucial for future research.

Abstract

The evaluation of interactive machine learning systems remains a difficult task. These systems learn from and adapt to the human, but at the same time, the human receives feedback and adapts to the system. Getting a clear understanding of these subtle mechanisms of co-operation and co-adaptation is challenging. In this chapter, we report on our experience in designing and evaluating various interactive machine learning applications from different domains. We argue for coupling two types of validation: algorithm-centered analysis, to study the computational behaviour of the system; and human-centered evaluation, to observe the utility and effectiveness of the application for end-users. We use a visual analytics application for guided search, built using an interactive evolutionary approach, as an exemplar of our work. Our observation is that human-centered design and evaluation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.