Explicit Grammar Semantic Feature Fusion for Robust Text Classification
Azrin Sultana, Firoz Ahmed

TL;DR
This paper introduces a lightweight text classification model that explicitly encodes grammatical and semantic features, outperforming traditional transformer-based models especially in resource-constrained environments.
Contribution
It presents a novel approach combining explicit grammatical rules with semantic embeddings into a unified, compact representation for robust text classification.
Findings
Outperforms baseline models by 2-15% in accuracy
Captures both structural and semantic text properties effectively
Suitable for deployment on edge devices
Abstract
Natural Language Processing enables computers to understand human language by analysing and classifying text efficiently with deep-level grammatical and semantic features. Existing models capture features by learning from large corpora with transformer models, which are computationally intensive and unsuitable for resource-constrained environments. Therefore, our proposed study incorporates comprehensive grammatical rules alongside semantic information to build a robust, lightweight classification model without resorting to full parameterised transformer models or heavy deep learning architectures. The novelty of our approach lies in its explicit encoding of sentence-level grammatical structure, including syntactic composition, phrase patterns, and complexity indicators, into a compact grammar vector, which is then fused with frozen contextual embeddings. These heterogeneous elements…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Big Data and Digital Economy
