Automatic Prediction of the Performance of Every Parser

Ergun Bi\c{c}ici

arXiv:2407.05116·cs.CL·July 9, 2024

Automatic Prediction of the Performance of Every Parser

Ergun Bi\c{c}ici

PDF

Open Access

TL;DR

This paper introduces MTPPS-PPP, a universal machine learning model that predicts parser performance across languages and parsers using only textual and structural features, aiding in parser selection and understanding text complexity.

Contribution

The novel MTPPS-PPP system predicts parser performance without language or parser-specific data, outperforming previous methods in accuracy and versatility.

Findings

01

Achieves 0.0678 MAE and 0.85 RAE in performance prediction.

02

Outperforms textual feature-based methods and matches parser-specific approaches.

03

Effective across different languages, domains, and learning settings.

Abstract

We present a new parser performance prediction (PPP) model using machine translation performance prediction system (MTPPS), statistically independent of any language or parser, relying only on extrinsic and novel features based on textual, link structural, and bracketing tree structural information. This new system, MTPPS-PPP, can predict the performance of any parser in any language and can be useful for estimating the grammatical difficulty when understanding a given text, for setting expectations from parsing output, for parser selection for a specific domain, and for parser combination systems. We obtain SoA results in PPP of bracketing $F_{1}$ with better results over textual features and similar performance with previous results that use parser and linguistic label specific information. Our results show the contribution of different types of features as well as rankings of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFuzzy Logic and Control Systems · Algorithms and Data Compression · Machine Learning in Bioinformatics

MethodsMasked autoencoder · Regularized Autoencoders