Maximum Spanning Trees Are Invariant to Temperature Scaling in   Graph-based Dependency Parsing

Stefan Gr\"unewald

arXiv:2106.08159·cs.CL·June 16, 2021

Maximum Spanning Trees Are Invariant to Temperature Scaling in Graph-based Dependency Parsing

Stefan Gr\"unewald

PDF

Open Access

TL;DR

This paper proves that temperature scaling, a common calibration method, does not affect the output of maximum spanning tree algorithms in neural graph-based dependency parsers, indicating the need for alternative calibration techniques.

Contribution

The paper provides a theoretical proof that temperature scaling cannot alter the dependency parsing results derived from neural network scores.

Findings

01

Temperature scaling does not change maximum spanning tree outputs.

02

Calibration techniques must be different from temperature scaling for dependency parsers.

03

Miscalibration issues require new solutions beyond temperature scaling.

Abstract

Modern graph-based syntactic dependency parsers operate by predicting, for each token within a sentence, a probability distribution over its possible syntactic heads (i.e., all other tokens) and then extracting a maximum spanning tree from the resulting log-probabilities. Nowadays, virtually all such parsers utilize deep neural networks and may thus be susceptible to miscalibration (in particular, overconfident predictions). In this paper, we prove that temperature scaling, a popular technique for post-hoc calibration of neural networks, cannot change the output of the aforementioned procedure. We conclude that other techniques are needed to tackle miscalibration in graph-based dependency parsers in a way that improves parsing accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification