Lexidate: Model Evaluation and Selection with Lexicase

Jose Guadalupe Hernandez; Anil Kumar Saini; and Jason H. Moore

arXiv:2406.12006·cs.NE·June 19, 2024

Lexidate: Model Evaluation and Selection with Lexicase

Jose Guadalupe Hernandez, Anil Kumar Saini, and Jason H. Moore

PDF

Open Access

TL;DR

Lexidate introduces a lexicase-based validation method for automated machine learning that uses multiple prediction values for model selection, reducing training time while maintaining comparable accuracy to traditional cross-validation.

Contribution

The paper presents lexidate, a novel validation approach using lexicase selection with multiple prediction values, improving efficiency in automated model selection.

Findings

01

Lexidate reduces training time compared to 10-fold CV.

02

Final model accuracy is comparable to 10-fold CV on most tasks.

03

Lexidate produces similar or less complex pipelines.

Abstract

Automated machine learning streamlines the task of finding effective machine learning pipelines by automating model training, evaluation, and selection. Traditional evaluation strategies, like cross-validation (CV), generate one value that averages the accuracy of a pipeline's predictions. This single value, however, may not fully describe the generalizability of the pipeline. Here, we present Lexicase-based Validation (lexidate), a method that uses multiple, independent prediction values for selection. Lexidate splits training data into a learning set and a selection set. Pipelines are trained on the learning set and make predictions on the selection set. The predictions are graded for correctness and used by lexicase selection to identify parent pipelines. Compared to 10-fold CV, lexicase reduces the training time. We test the effectiveness of three lexidate configurations within the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies

MethodsSparse Evolutionary Training