Guardrails for trust, safety, and ethical development and deployment of Large Language Models (LLM)
Anjanava Biswas, Wrick Talukdar

TL;DR
This paper discusses the importance of safety, privacy, and ethical considerations in deploying Large Language Models (LLMs) and proposes a flexible mechanism with trust and safety modules to implement guardrails.
Contribution
It introduces a novel Flexible Adaptive Sequencing mechanism with trust and safety modules for safeguarding LLM deployment.
Findings
Proposed a new guardrail framework for LLM safety.
Addresses privacy and ethical concerns in LLM deployment.
Enhances safety measures for generative AI applications.
Abstract
The AI era has ushered in Large Language Models (LLM) to the technological forefront, which has been much of the talk in 2023, and is likely to remain as such for many years to come. LLMs are the AI models that are the power house behind generative AI applications such as ChatGPT. These AI models, fueled by vast amounts of data and computational prowess, have unlocked remarkable capabilities, from human-like text generation to assisting with natural language understanding (NLU) tasks. They have quickly become the foundation upon which countless applications and software services are being built, or at least being augmented with. However, as with any groundbreaking innovations, the rise of LLMs brings forth critical safety, privacy, and ethical concerns. These models are found to have a propensity to leak private information, produce false information, and can be coerced into generating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Adversarial Robustness in Machine Learning · Ethics and Social Impacts of AI
