An Empirical study on LLM-based Log Retrieval for Software Engineering Metadata Management
Simin Sun, Yuchuan Jin, Miroslaw Staron

TL;DR
This paper presents an LLM-supported method for natural language log retrieval in autonomous driving systems, combining signal logs and videos to improve efficiency and reliability over traditional SQL queries.
Contribution
It introduces a novel LLM-based approach that integrates log data and videos for scenario search, with quantifiable metrics and an API implementation, enhancing retrieval accuracy and usability.
Findings
Improved efficiency in scenario retrieval
Enhanced reliability without relying solely on SQL
Effective integration of logs and videos for visualization
Abstract
Developing autonomous driving systems (ADSs) involves generating and storing extensive log data from test drives, which is essential for verification, research, and simulation. However, these high-frequency logs, recorded over varying durations, pose challenges for developers attempting to locate specific driving scenarios. This difficulty arises due to the wide range of signals representing various vehicle components and driving conditions, as well as unfamiliarity of some developers' with the detailed meaning of these signals. Traditional SQL-based querying exacerbates this challenge by demanding both domain expertise and database knowledge, often yielding results that are difficult to verify for accuracy. This paper introduces a Large Language Model (LLM)-supported approach that combines signal log data with video recordings from test drives, enabling natural language based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Service-Oriented Architecture and Web Services · Data Quality and Management
