Imbalances in Neurosymbolic Learning: Characterization and Mitigating Strategies

Kaifu Wang; Efthymia Tsamoura; Dan Roth

arXiv:2407.10000·cs.LG·December 18, 2025

Imbalances in Neurosymbolic Learning: Characterization and Mitigating Strategies

Kaifu Wang, Efthymia Tsamoura, Dan Roth

PDF

Open Access 1 Video

TL;DR

This paper investigates learning imbalances in neurosymbolic learning, revealing how symbolic components influence errors across classes, and proposes strategies to estimate label distributions and mitigate these imbalances, improving performance.

Contribution

It characterizes the impact of symbolic components on class-specific risks in NSL and introduces practical algorithms to estimate label marginals and reduce learning imbalances.

Findings

01

Learning imbalances are significantly affected by the symbolic component σ.

02

Proposed methods improve performance by up to 14% on baseline models.

03

Techniques are effective in both training and testing phases.

Abstract

We study one of the most popular problems in **neurosymbolic learning** (NSL), that of learning neural classifiers given only the result of applying a symbolic component $σ$ to the gold labels of the elements of a vector $x$ . The gold labels of the elements in $x$ are unknown to the learner. We make multiple contributions, theoretical and practical, to address a problem that has not been studied so far in this context, that of characterizing and mitigating *learning imbalances*, i.e., major differences in the errors that occur when classifying instances of different classes (aka **class-specific risks**). Our theoretical analysis reveals a unique phenomenon: that $σ$ can greatly impact learning imbalances. This result sharply contrasts with previous research on supervised and weakly supervised learning, which only studies learning imbalances under data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Imbalances in Neurosymbolic Learning: Characterization and Mitigating Strategies· slideslive

Taxonomy

TopicsTransport Systems and Technology

MethodsFocus