Transforming Computer Security and Public Trust Through the Exploration   of Fine-Tuning Large Language Models

Garrett Crumrine; Izzat Alsmadi; Jesus Guerrero; Yuvaraj Munian

arXiv:2406.00628·cs.CL·August 12, 2024

Transforming Computer Security and Public Trust Through the Exploration of Fine-Tuning Large Language Models

Garrett Crumrine, Izzat Alsmadi, Jesus Guerrero, Yuvaraj Munian

PDF

Open Access

TL;DR

This paper investigates how fine-tuning large language models can both reveal vulnerabilities exploited by malicious services called Mallas and inform strategies to enhance AI security and trustworthiness.

Contribution

It introduces fine-tuning approaches to analyze Mallas' exploitation techniques and proposes methods to improve LLM security against malicious misuse.

Findings

01

Fine-tuning reveals Mallas' operational strategies

02

Identifies vulnerabilities in pre-trained language models

03

Suggests safeguards for secure AI deployment

Abstract

Large language models (LLMs) have revolutionized how we interact with machines. However, this technological advancement has been paralleled by the emergence of "Mallas," malicious services operating underground that exploit LLMs for nefarious purposes. Such services create malware, phishing attacks, and deceptive websites, escalating the cyber security threats landscape. This paper delves into the proliferation of Mallas by examining the use of various pre-trained language models and their efficiency and vulnerabilities when misused. Building on a dataset from the Common Vulnerabilities and Exposures (CVE) program, it explores fine-tuning methodologies to generate code and explanatory text related to identified vulnerabilities. This research aims to shed light on the operational strategies and exploitation techniques of Mallas, leading to the development of more secure and trustworthy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data