Interactive Graphics for Visually Diagnosing Forest Classifiers in R

Natalia da Silva; Dianne Cook; Eun-Kyung Lee

arXiv:1704.02502·stat.ML·February 25, 2026·Comput. Stat.·2 cites

Interactive Graphics for Visually Diagnosing Forest Classifiers in R

Natalia da Silva, Dianne Cook, Eun-Kyung Lee

PDF

Open Access

TL;DR

This paper presents interactive R graphics tools for diagnosing and understanding forest classifiers, enabling detailed exploration of model complexity, variable importance, and uncertainty in ensemble models.

Contribution

It introduces interactive visualization methods in R for analyzing forest classifiers, enhancing interpretability of ensemble models like random forests and projection pursuit forests.

Findings

01

Interactive graphics reveal insights into class structure and variable importance.

02

Tools help assess model complexity and prediction uncertainty.

03

Applicable to various bagged ensemble methods.

Abstract

This paper describes structuring data and constructing plots to explore forest classification models interactively. A forest classifier is an example of an ensemble, produced by bagging multiple trees. The process of bagging and combining results from multiple trees, produces numerous diagnostics which, with interactive graphics, can provide a lot of insight into class structure in high dimensions. Various aspects are explored in this paper, to assess model complexity, individual model contributions, variable importance and dimension reduction, and uncertainty in prediction associated with individual observations. The ideas are applied to the random forest algorithm, and to the projection pursuit forest, but could be more broadly applied to other bagged ensembles. Interactive graphics are built in R, using the ggplot2, plotly, and shiny packages.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Analysis with R · Hydrological Forecasting Using AI