LLMs for Multi-Modal Knowledge Extraction and Analysis in   Intelligence/Safety-Critical Applications

Brett Israelsen; Soumalya Sarkar

arXiv:2312.03088·cs.CL·December 7, 2023·1 cites

LLMs for Multi-Modal Knowledge Extraction and Analysis in Intelligence/Safety-Critical Applications

Brett Israelsen, Soumalya Sarkar

PDF

Open Access

TL;DR

This paper reviews recent advances and vulnerabilities of Large Language Models, emphasizing their potential and challenges in deploying them safely in intelligence and safety-critical applications.

Contribution

It synthesizes current research on LLM assessment and vulnerabilities, providing a structured overview to guide safe application in critical domains.

Findings

01

LLMs are approaching human-level performance on benchmarks.

02

Vulnerabilities are categorized into ten high-level groups.

03

Mitigation strategies are reviewed for addressing these vulnerabilities.

Abstract

Large Language Models have seen rapid progress in capability in recent years; this progress has been accelerating and their capabilities, measured by various benchmarks, are beginning to approach those of humans. There is a strong demand to use such models in a wide variety of applications but, due to unresolved vulnerabilities and limitations, great care needs to be used before applying them to intelligence and safety-critical applications. This paper reviews recent literature related to LLM assessment and vulnerabilities to synthesize the current research landscape and to help understand what advances are most critical to enable use of of these technologies in intelligence and safety-critical applications. The vulnerabilities are broken down into ten high-level categories and overlaid onto a high-level life cycle of an LLM. Some general categories of mitigations are reviewed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Reliability and Analysis Research · Text Readability and Simplification · Software Engineering Research