Active Learning for Robust and Representative LLM Generation in   Safety-Critical Scenarios

Sabit Hassan; Anthony Sicilia; Malihe Alikhani

arXiv:2410.11114·cs.CL·October 16, 2024

Active Learning for Robust and Representative LLM Generation in Safety-Critical Scenarios

Sabit Hassan, Anthony Sicilia, Malihe Alikhani

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel active learning framework with clustering to guide LLM generation, improving the diversity and robustness of safety scenario data for critical applications.

Contribution

It presents a new method combining active learning and clustering to generate more representative safety scenarios without prior distribution knowledge.

Findings

01

Generated 5.4K safety violation scenarios

02

Improved accuracy and F1 scores of models using the data

03

Enhanced diversity and robustness of safety data

Abstract

Ensuring robust safety measures across a wide range of scenarios is crucial for user-facing systems. While Large Language Models (LLMs) can generate valuable data for safety measures, they often exhibit distributional biases, focusing on common scenarios and neglecting rare but critical cases. This can undermine the effectiveness of safety protocols developed using such data. To address this, we propose a novel framework that integrates active learning with clustering to guide LLM generation, enhancing their representativeness and robustness in safety scenarios. We demonstrate the effectiveness of our approach by constructing a dataset of 5.4K potential safety violations through an iterative process involving LLM generation and an active learner model's feedback. Our results show that the proposed framework produces a more representative set of safety scenarios without requiring prior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Active Learning for Robust and Representative LLM Generation in Safety-Critical Scenarios· underline

Taxonomy

TopicsSoftware Reliability and Analysis Research · Formal Methods in Verification · Advanced Control Systems Design

MethodsSparse Evolutionary Training