LLM-Based Identification of Infostealer Infection Vectors from Screenshots: The Case of Aurora

Estelle Ruellan; Eric Clay; Nicholas Ascoli

arXiv:2507.23611·cs.CR·August 1, 2025

LLM-Based Identification of Infostealer Infection Vectors from Screenshots: The Case of Aurora

Estelle Ruellan, Eric Clay, Nicholas Ascoli

PDF

Open Access

TL;DR

This paper presents a novel LLM-based method to analyze infection screenshots for identifying malware infection vectors, specifically for Aurora, enabling scalable threat intelligence and early intervention.

Contribution

It introduces a new approach using GPT-4o-mini to analyze screenshots for IoCs, infection vectors, and campaigns, filling a gap in reactive malware analysis.

Findings

01

Extracted 337 URLs and 246 files from 1000 screenshots

02

Identified three distinct malware campaigns

03

Demonstrated the effectiveness of LLMs in threat analysis

Abstract

Infostealers exfiltrate credentials, session cookies, and sensitive data from infected systems. With over 29 million stealer logs reported in 2024, manual analysis and mitigation at scale are virtually unfeasible/unpractical. While most research focuses on proactive malware detection, a significant gap remains in leveraging reactive analysis of stealer logs and their associated artifacts. Specifically, infection artifacts such as screenshots, image captured at the point of compromise, are largely overlooked by the current literature. This paper introduces a novel approach leveraging Large Language Models (LLMs), more specifically gpt-4o-mini, to analyze infection screenshots to extract potential Indicators of Compromise (IoCs), map infection vectors, and track campaigns. Focusing on the Aurora infostealer, we demonstrate how LLMs can process screenshots to identify infection vectors,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Anomaly Detection Techniques and Applications · Digital Media Forensic Detection