A Curriculum Learning Approach for Multi-domain Text Classification Using Keyword weight Ranking
Zilin Yuan, Yinghui Li, Yangning Li, Rui Xie, Wei Wu, Hai-Tao Zheng

TL;DR
This paper introduces a curriculum learning strategy based on keyword weight ranking to enhance multi-domain text classification models, especially when domain-specific data is limited, demonstrating improved performance over existing methods.
Contribution
The paper proposes a novel curriculum learning approach utilizing keyword weight ranking to better leverage multi-domain data in text classification tasks.
Findings
Improved classification accuracy on Amazon review and FDU-MTL datasets.
Outperforms state-of-the-art multi-domain classification methods.
Effective in scenarios with limited domain-specific annotated data.
Abstract
Text classification is a very classic NLP task, but it has two prominent shortcomings: On the one hand, text classification is deeply domain-dependent. That is, a classifier trained on the corpus of one domain may not perform so well in another domain. On the other hand, text classification models require a lot of annotated data for training. However, for some domains, there may not exist enough annotated data. Therefore, it is valuable to investigate how to efficiently utilize text data from different domains to improve the performance of models in various domains. Some multi-domain text classification models are trained by adversarial training to extract shared features among all domains and the specific features of each domain. We noted that the distinctness of the domain-specific features is different, so in this paper, we propose to use a curriculum learning strategy based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Sentiment Analysis and Opinion Mining · Topic Modeling
