Imbalances in Neurosymbolic Learning: Characterization and Mitigating Strategies
Kaifu Wang, Efthymia Tsamoura, Dan Roth

TL;DR
This paper investigates learning imbalances in neurosymbolic learning, revealing how symbolic components influence errors across classes, and proposes strategies to estimate label distributions and mitigate these imbalances, improving performance.
Contribution
It characterizes the impact of symbolic components on class-specific risks in NSL and introduces practical algorithms to estimate label marginals and reduce learning imbalances.
Findings
Learning imbalances are significantly affected by the symbolic component σ.
Proposed methods improve performance by up to 14% on baseline models.
Techniques are effective in both training and testing phases.
Abstract
We study one of the most popular problems in **neurosymbolic learning** (NSL), that of learning neural classifiers given only the result of applying a symbolic component to the gold labels of the elements of a vector . The gold labels of the elements in are unknown to the learner. We make multiple contributions, theoretical and practical, to address a problem that has not been studied so far in this context, that of characterizing and mitigating *learning imbalances*, i.e., major differences in the errors that occur when classifying instances of different classes (aka **class-specific risks**). Our theoretical analysis reveals a unique phenomenon: that can greatly impact learning imbalances. This result sharply contrasts with previous research on supervised and weakly supervised learning, which only studies learning imbalances under data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTransport Systems and Technology
MethodsFocus
