Digestion Algorithm in Hierarchical Symbolic Forests: A Fast Text Normalization Algorithm and Semantic Parsing Framework for Specific Scenarios and Lightweight Deployment
Kevin You

TL;DR
This paper introduces DAHSF, a fast, lightweight text normalization and semantic parsing framework based on hierarchical symbolic forests, suitable for scenario-specific domains with limited data and requiring quick responses.
Contribution
The paper presents the novel DAHSF algorithm, combining symbolic forests with a multilayer framework to enhance interpretability, efficiency, and data utilization in text normalization and semantic parsing.
Findings
DAHSF runs efficiently on small datasets with minimal memory usage.
Model size and response time are reduced by at least two orders of magnitude.
Successfully applied in the Chinese scripting language platform.
Abstract
Text Normalization and Semantic Parsing have numerous applications in natural language processing, such as natural language programming, paraphrasing, data augmentation, constructing expert systems, text matching, and more. Despite the prominent achievements of deep learning in Large Language Models (LLMs), the interpretability of neural network architectures is still poor, which affects their credibility and hence limits the deployments of risk-sensitive scenarios. In certain scenario-specific domains with scarce data, rapidly obtaining a large number of supervised learning labels is challenging, and the workload of manually labeling data would be enormous. Catastrophic forgetting in neural networks further leads to low data utilization rates. In situations where swift responses are vital, the density of the model makes local deployment difficult and the response time long, which is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Model-Driven Software Engineering Techniques
MethodsDigestion Algorithm in Hierarchical Symbolic Forests
