Learning Section Weights for Multi-Label Document Classification
Maziar Moradi Fard, Paula Sorrolla Bayod, Kiomars Motarjem, Mohammad, Alian Nejadi, Saber Akhondi, Camilo Thorne

TL;DR
This paper introduces Learning Section Weights (LSW), a novel method that assigns importance to different document sections to improve multi-label classification accuracy, especially for scientific articles.
Contribution
The paper proposes LSW, a new approach that learns to weight document sections dynamically, outperforming existing methods in multi-label document classification tasks.
Findings
LSW achieves a 1.3% improvement in macro F1-score on arXiv dataset.
LSW outperforms state-of-the-art methods in multi-label classification.
Experimental results confirm the effectiveness of section weighting.
Abstract
Multi-label document classification is a traditional task in NLP. Compared to single-label classification, each document can be assigned multiple classes. This problem is crucially important in various domains, such as tagging scientific articles. Documents are often structured into several sections such as abstract and title. Current approaches treat different sections equally for multi-label classification. We argue that this is not a realistic assumption, leading to sub-optimal results. Instead, we propose a new method called Learning Section Weights (LSW), leveraging the contribution of each distinct section for multi-label classification. Via multiple feed-forward layers, LSW learns to assign weights to each section of, and incorporate the weights in the prediction. We demonstrate our approach on scientific articles. Experimental results on public (arXiv) and private (Elsevier)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Advanced Text Analysis Techniques · Topic Modeling
