Deep or Simple Models for Semantic Tagging? It Depends on your Data [Experiments]
Jinfeng Li, Yuliang Li, Xiaolan Wang, Wang-Chiew Tan

TL;DR
This study systematically compares deep and simple models for semantic tagging across diverse datasets, revealing that simple models often perform comparably or better depending on dataset characteristics, thus guiding model selection.
Contribution
It provides a comprehensive analysis of when simple models can match or outperform deep models in semantic tagging tasks, based on dataset properties.
Findings
Simple models perform similarly to deep models on large datasets.
Simple models are faster and can outperform deep models on noisy or imbalanced datasets.
Dataset size, label ratio, and cleanliness significantly influence model performance.
Abstract
Semantic tagging, which has extensive applications in text mining, predicts whether a given piece of text conveys the meaning of a given semantic tag. The problem of semantic tagging is largely solved with supervised learning and today, deep learning models are widely perceived to be better for semantic tagging. However, there is no comprehensive study supporting the popular belief. Practitioners often have to train different types of models for each semantic tagging task to identify the best model. This process is both expensive and inefficient. We embark on a systematic study to investigate the following question: Are deep models the best performing model for all semantic tagging tasks? To answer this question, we compare deep models against "simple models" over datasets with varying characteristics. Specifically, we select three prevalent deep models (i.e. CNN, LSTM, and BERT) and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
