An A*-algorithm for the Unordered Tree Edit Distance with Custom Costs

Benjamin Paa{\ss}en

arXiv:2108.00953·cs.AI·February 11, 2022

An A*-algorithm for the Unordered Tree Edit Distance with Custom Costs

Benjamin Paa{\ss}en

PDF

1 Repo

TL;DR

This paper introduces three new heuristics for the A* algorithm to compute unordered tree edit distances with custom costs, enhancing efficiency and accuracy in chemical data analysis.

Contribution

It presents novel heuristics for A* that accommodate custom cost functions, enabling better domain-specific distance computations.

Findings

01

Custom heuristics improve A* speed on chemical datasets.

02

Using custom costs enhances the accuracy of property prediction.

03

Polynomial edit distances perform comparably to unordered tree edit distances.

Abstract

The unordered tree edit distance is a natural metric to compute distances between trees without intrinsic child order, such as representations of chemical molecules. While the unordered tree edit distance is MAX SNP-hard in principle, it is feasible for small cases, e.g. via an A* algorithm. Unfortunately, current heuristics for the A* algorithm assume unit costs for deletions, insertions, and replacements, which limits our ability to inject domain knowledge. In this paper, we present three novel heuristics for the A* algorithm that work with custom cost functions. In experiments on two chemical data sets, we show that custom costs make the A* computation faster and improve the error of a 5-nearest neighbor regressor, predicting chemical properties. We also show that, on these data, polynomial edit distances can achieve similar results as the unordered tree edit distance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://gitlab.com/bpaassen/uted
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.