Algorithm Selection with Zero Domain Knowledge via Text Embeddings

Stefan Szeider

arXiv:2604.19753·cs.AI·April 23, 2026

Algorithm Selection with Zero Domain Knowledge via Text Embeddings

Stefan Szeider

PDF

TL;DR

This paper introduces ZeroFolio, a domain-agnostic algorithm selection method using pretrained text embeddings, outperforming traditional feature-based approaches across multiple problem domains.

Contribution

It presents a feature-free, text embedding-based pipeline for algorithm selection that works across diverse domains without domain-specific training.

Findings

01

Outperforms random forest with hand-crafted features in 10 of 11 scenarios

02

Achieves perfect performance with two-seed voting in all scenarios

03

Key design choices include inverse-distance weighting and Manhattan distance

Abstract

We propose a feature-free approach to algorithm selection that replaces hand-crafted instance features with pretrained text embeddings. Our method, ZeroFolio, proceeds in three steps: it reads the raw instance file as plain text, embeds it with a pretrained embedding model, and selects an algorithm via weighted k-nearest neighbors. The key to our approach is the observation that pretrained embeddings produce representations that distinguish problem instances without any domain knowledge or task-specific training. This allows us to apply the same three-step pipeline (serialize, embed, select) across diverse problem domains with text-based instance formats. We evaluate our approach on 11 ASlib scenarios spanning 7 domains (SAT, MaxSAT, QBF, ASP, CSP, MIP, and graph problems). Our experiments show that this approach outperforms a random forest trained on hand-crafted features in 10 of 11…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.