$\textit{GeoHard}$: Towards Measuring Class-wise Hardness through   Modelling Class Semantics

Fengyu Cai; Xinran Zhao; Hongming Zhang; Iryna Gurevych; Heinz Koeppl

arXiv:2407.12512·cs.CL·July 18, 2024

$\textit{GeoHard}$: Towards Measuring Class-wise Hardness through Modelling Class Semantics

Fengyu Cai, Xinran Zhao, Hongming Zhang, Iryna Gurevych, Heinz Koeppl

PDF

Open Access 1 Video

TL;DR

This paper introduces GeoHard, a novel method for measuring class-wise hardness in NLP tasks by modeling class geometry in semantic space, revealing insights into data difficulty and improving model training.

Contribution

It pioneers the concept of class-wise hardness, proposes GeoHard for its measurement, and demonstrates its effectiveness and generality across multiple datasets and models.

Findings

01

GeoHard outperforms previous instance-level metrics by over 59% in correlation.

02

Class-wise hardness is consistent across datasets, models, and human judgment.

03

Understanding class-wise hardness aids in improving task learning.

Abstract

Recent advances in measuring hardness-wise properties of data guide language models in sample selection within low-resource scenarios. However, class-specific properties are overlooked for task setup and learning. How will these properties influence model learning and is it generalizable across datasets? To answer this question, this work formally initiates the concept of $class-wise hardness$ . Experiments across eight natural language understanding (NLU) datasets demonstrate a consistent hardness distribution across learning paradigms, models, and human judgment. Subsequent experiments unveil a notable challenge in measuring such class-wise hardness with instance-level metrics in previous works. To address this, we propose $GeoHard$ for class-wise hardness measurement by modeling class geometry in the semantic embedding space. $GeoHard$ surpasses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

$\textit{GeoHard}$: Towards Measuring Class-wise Hardness through Modelling Class Semantics· underline

Taxonomy

TopicsSoftware Engineering Research · Parallel Computing and Optimization Techniques