Prompt Tuned Embedding Classification for Multi-Label Industry Sector Allocation
Valentin Leonhard Buchner, Lele Cao, Jan-Christoph Kalo, Vilhelm von, Ehrenheim

TL;DR
This paper introduces Prompt Tuned Embedding Classification (PTEC), a novel method that enhances multi-label industry classification by improving accuracy and efficiency over traditional prompt tuning approaches, especially in domain-specific contexts.
Contribution
The study proposes PTEC, replacing the language head with a classification head, addressing key limitations of text-to-text classification in multi-label tasks, and demonstrates its effectiveness in industrial company classification.
Findings
PTEC outperforms baseline prompt tuning methods in accuracy.
PTEC reduces computational costs during inference.
Model performance remains consistent across well-known and less-known companies.
Abstract
Prompt Tuning is emerging as a scalable and cost-effective method to fine-tune Pretrained Language Models (PLMs), which are often referred to as Large Language Models (LLMs). This study benchmarks the performance and computational efficiency of Prompt Tuning and baselines for multi-label text classification. This is applied to the challenging task of classifying companies into an investment firm's proprietary industry taxonomy, supporting their thematic investment strategy. Text-to-text classification is frequently reported to outperform task-specific classification heads, but has several limitations when applied to a multi-label classification problem where each label consists of multiple tokens: (a) Generated labels may not match any label in the label taxonomy; (b) The fine-tuning process lacks permutation invariance and is sensitive to the order of the provided labels; (c) The model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Natural Language Processing Techniques · Topic Modeling
