Good Enough to Learn: LLM-based Anomaly Detection in ECU Logs without Reliable Labels

Bogdan Bogdan; Arina Cazacu; Laura Vasilie

arXiv:2507.01077·cs.LG·July 3, 2025

Good Enough to Learn: LLM-based Anomaly Detection in ECU Logs without Reliable Labels

Bogdan Bogdan, Arina Cazacu, Laura Vasilie

PDF

Open Access

TL;DR

This paper introduces a decoder-only Large Language Model for anomaly detection in automotive ECU logs, effectively handling unreliable labels and learning from minimal data to improve detection accuracy.

Contribution

It presents a novel decoder-only LLM architecture for ECU anomaly detection, addressing label inconsistency and adaptability across use cases.

Findings

01

Effective anomaly detection in ECU logs using minimal labeled data

02

Entropy regularization increases model uncertainty in known anomalies

03

Scalable approach reduces manual labeling costs

Abstract

Anomaly detection often relies on supervised or clustering approaches, with limited success in specialized domains like automotive communication systems where scalable solutions are essential. We propose a novel decoder-only Large Language Model (LLM) to detect anomalies in Electronic Control Unit (ECU) communication logs. Our approach addresses two key challenges: the lack of LLMs tailored for ECU communication and the complexity of inconsistent ground truth data. By learning from UDP communication logs, we formulate anomaly detection simply as identifying deviations in time from normal behavior. We introduce an entropy regularization technique that increases model's uncertainty in known anomalies while maintaining consistency in similar scenarios. Our solution offers three novelties: a decoder-only anomaly detection architecture, a way to handle inconsistent labeling, and an adaptable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Software System Performance and Reliability · Smart Grid Security and Resilience