LLM-Based Identification of Infostealer Infection Vectors from Screenshots: The Case of Aurora
Estelle Ruellan, Eric Clay, Nicholas Ascoli

TL;DR
This paper presents a novel LLM-based method to analyze infection screenshots for identifying malware infection vectors, specifically for Aurora, enabling scalable threat intelligence and early intervention.
Contribution
It introduces a new approach using GPT-4o-mini to analyze screenshots for IoCs, infection vectors, and campaigns, filling a gap in reactive malware analysis.
Findings
Extracted 337 URLs and 246 files from 1000 screenshots
Identified three distinct malware campaigns
Demonstrated the effectiveness of LLMs in threat analysis
Abstract
Infostealers exfiltrate credentials, session cookies, and sensitive data from infected systems. With over 29 million stealer logs reported in 2024, manual analysis and mitigation at scale are virtually unfeasible/unpractical. While most research focuses on proactive malware detection, a significant gap remains in leveraging reactive analysis of stealer logs and their associated artifacts. Specifically, infection artifacts such as screenshots, image captured at the point of compromise, are largely overlooked by the current literature. This paper introduces a novel approach leveraging Large Language Models (LLMs), more specifically gpt-4o-mini, to analyze infection screenshots to extract potential Indicators of Compromise (IoCs), map infection vectors, and track campaigns. Focusing on the Aurora infostealer, we demonstrate how LLMs can process screenshots to identify infection vectors,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Anomaly Detection Techniques and Applications · Digital Media Forensic Detection
