Few-shot learning approaches for classifying low resource domain   specific software requirements

Anmol Nayak; Hari Prasad Timmapathini; Vidhya Murali; Atul Anil Gohad

arXiv:2302.06951·cs.CL·February 15, 2023·1 cites

Few-shot learning approaches for classifying low resource domain specific software requirements

Anmol Nayak, Hari Prasad Timmapathini, Vidhya Murali, Atul Anil Gohad

PDF

Open Access

TL;DR

This paper investigates few-shot learning methods to classify low-resource automotive software requirements using pre-trained NLP models, demonstrating that certain models perform well with minimal annotated data.

Contribution

It evaluates multiple algorithms for fine-tuning pre-trained models on automotive domain requirements with only 15 samples per category, highlighting effective approaches in low-resource settings.

Findings

01

SciBERT and DeBERTa outperform others at 15 samples

02

Performance gains plateau beyond 50 samples for some models

03

Siamese and T5 models show competitive results with fewer samples

Abstract

With the advent of strong pre-trained natural language processing models like BERT, DeBERTa, MiniLM, T5, the data requirement for industries to fine-tune these models to their niche use cases has drastically reduced (typically to a few hundred annotated samples for achieving a reasonable performance). However, the availability of even a few hundred annotated samples may not always be guaranteed in low resource domains like automotive, which often limits the usage of such deep learning models in an industrial setting. In this paper we aim to address the challenge of fine-tuning such pre-trained models with only a few annotated samples, also known as Few-shot learning. Our experiments focus on evaluating the performance of a diverse set of algorithms and methodologies to achieve the task of classifying BOSCH automotive domain textual software requirements into 3 categories, while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Reliability and Analysis Research · Software Engineering Research · Machine Learning and Data Classification

MethodsGated Linear Unit · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Byte Pair Encoding · Adafactor · SentencePiece · Inverse Square Root Schedule · WordPiece · Softmax