Leveraging Large Language Models for Trustworthiness Assessment of Web Applications
Oleksandr Yarotskyi, Jos\'e D'Abruzzo Pereira, Jo\~ao R. Campos

TL;DR
This paper proposes an automated approach using Large Language Models to assess the trustworthiness of web applications by verifying secure coding practices, aiming to improve scalability and objectivity in security evaluations.
Contribution
It introduces a novel empirical methodology leveraging LLMs with various prompt techniques and extends a hierarchical Quality Model to quantify web application trustworthiness.
Findings
Prompt engineering impacts assessment reliability.
Structural context can introduce noise in LLM outputs.
Rule-based prompts enhance trustworthiness evaluation.
Abstract
The widespread adoption of web applications has made their security a critical concern and has increased the need for systematic ways to assess whether they can be considered trustworthy. However, "trust" assessment remains an open problem as existing techniques primarily focus on detecting known vulnerabilities or depend on manual evaluation, which limits their scalability; therefore, evaluating adherence to secure coding practices offers a complementary, pragmatic perspective by focusing on observable development behaviors. In practice, the identification and verification of secure coding practices are predominantly performed manually, relying on expert knowledge and code reviews, which is time-consuming, subjective, and difficult to scale. This study presents an empirical methodology to automate the trustworthiness assessment of web applications by leveraging Large Language Models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Application Security Vulnerabilities · Information and Cyber Security · Security and Verification in Computing
