Automatic Prediction of the Performance of Every Parser
Ergun Bi\c{c}ici

TL;DR
This paper introduces MTPPS-PPP, a universal machine learning model that predicts parser performance across languages and parsers using only textual and structural features, aiding in parser selection and understanding text complexity.
Contribution
The novel MTPPS-PPP system predicts parser performance without language or parser-specific data, outperforming previous methods in accuracy and versatility.
Findings
Achieves 0.0678 MAE and 0.85 RAE in performance prediction.
Outperforms textual feature-based methods and matches parser-specific approaches.
Effective across different languages, domains, and learning settings.
Abstract
We present a new parser performance prediction (PPP) model using machine translation performance prediction system (MTPPS), statistically independent of any language or parser, relying only on extrinsic and novel features based on textual, link structural, and bracketing tree structural information. This new system, MTPPS-PPP, can predict the performance of any parser in any language and can be useful for estimating the grammatical difficulty when understanding a given text, for setting expectations from parsing output, for parser selection for a specific domain, and for parser combination systems. We obtain SoA results in PPP of bracketing with better results over textual features and similar performance with previous results that use parser and linguistic label specific information. Our results show the contribution of different types of features as well as rankings of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFuzzy Logic and Control Systems · Algorithms and Data Compression · Machine Learning in Bioinformatics
MethodsMasked autoencoder · Regularized Autoencoders
