Act as a Honeytoken Generator! An Investigation into Honeytoken   Generation with Large Language Models

Daniel Reti; Norman Becker; Tillmann Angeli; Anasuya Chattopadhyay,; Daniel Schneider; Sebastian Vollmer; Hans D. Schotten

arXiv:2404.16118·cs.CR·April 26, 2024

Act as a Honeytoken Generator! An Investigation into Honeytoken Generation with Large Language Models

Daniel Reti, Norman Becker, Tillmann Angeli, Anasuya Chattopadhyay,, Daniel Schneider, Sebastian Vollmer, Hans D. Schotten

PDF

Open Access

TL;DR

This paper explores using large language models to automate the creation of diverse honeytokens for cybersecurity, demonstrating their effectiveness and variability across different models and prompt structures.

Contribution

It systematically investigates prompt-based honeytoken generation with LLMs, addressing limitations of existing automated methods and evaluating performance across multiple models.

Findings

01

GPT-3.5 generates honeywords less distinguishable from real passwords.

02

Optimal prompts vary between different LLMs.

03

Large language models can produce a wide variety of honeytokens.

Abstract

With the increasing prevalence of security incidents, the adoption of deception-based defense strategies has become pivotal in cyber security. This work addresses the challenge of scalability in designing honeytokens, a key component of such defense mechanisms. The manual creation of honeytokens is a tedious task. Although automated generators exists, they often lack versatility, being specialized for specific types of honeytokens, and heavily rely on suitable training datasets. To overcome these limitations, this work systematically investigates the approach of utilizing Large Language Models (LLMs) to create a variety of honeytokens. Out of the seven different honeytoken types created in this work, such as configuration files, databases, and log files, two were used to evaluate the optimal prompt. The generation of robots.txt files and honeywords was used to systematically test 210…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Advanced Text Analysis Techniques · AI in Service Interactions