On Lexical Invariance on Multisets and Graphs
Muhan Zhang

TL;DR
This paper investigates the conditions under which functions on multisets and graphs are invariant to lexical transformations, establishing theoretical characterizations and validating them through synthetic experiments.
Contribution
It provides the first formal characterization of lexical invariance for functions on multisets and graphs, identifying their most expressive forms.
Findings
Functions on multisets depend only on counts of unique elements.
Most expressive graph functions depend on adjacency and difference matrices.
Synthetic experiments verify the theoretical characterizations.
Abstract
In this draft, we study a novel problem, called lexical invariance, using the medium of multisets and graphs. Traditionally in the NLP domain, lexical invariance indicates that the semantic meaning of a sentence should remain unchanged regardless of the specific lexical or word-based representation of the input. For example, ``The movie was extremely entertaining'' would have the same meaning as ``The film was very enjoyable''. In this paper, we study a more challenging setting, where the output of a function is invariant to any injective transformation applied to the input lexical space. For example, multiset {1,2,3,2} is equivalent to multiset {a,b,c,b} if we specify an injective transformation that maps 1 to a, 2 to b and 3 to c. We study the sufficient and necessary conditions for a most expressive lexical invariant (and permutation invariant) function on multisets and graphs, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Algebra and Logic · Rough Sets and Fuzzy Logic · Fuzzy and Soft Set Theory
