Evaluation of Interactive Machine Learning Systems
Nadia Boukhelifa, Anastasia Bezerianos, Evelyne Lutton

TL;DR
This paper discusses the challenges in evaluating interactive machine learning systems, emphasizing the importance of combining algorithmic and human-centered assessments to better understand co-adaptation and improve system design.
Contribution
It advocates for a dual validation approach and illustrates this with a visual analytics application, highlighting the benefits of integrating human-centered evaluation with algorithm analysis.
Findings
Human-centered evaluation complements algorithmic analysis.
Combining validation methods addresses the 'black-box' issue.
Human-computer interaction methods are crucial for future research.
Abstract
The evaluation of interactive machine learning systems remains a difficult task. These systems learn from and adapt to the human, but at the same time, the human receives feedback and adapts to the system. Getting a clear understanding of these subtle mechanisms of co-operation and co-adaptation is challenging. In this chapter, we report on our experience in designing and evaluating various interactive machine learning applications from different domains. We argue for coupling two types of validation: algorithm-centered analysis, to study the computational behaviour of the system; and human-centered evaluation, to observe the utility and effectiveness of the application for end-users. We use a visual analytics application for guided search, built using an interactive evolutionary approach, as an exemplar of our work. Our observation is that human-centered design and evaluation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
