Seclens: Role-specific Evaluation of LLM's for security vulnerablity detection

Subho Halder; Siddharth Saxena; Kashinath Kadaba Shrish; Thiyagarajan M

arXiv:2604.01637·cs.CR·April 3, 2026

Seclens: Role-specific Evaluation of LLM's for security vulnerablity detection

Subho Halder, Siddharth Saxena, Kashinath Kadaba Shrish, Thiyagarajan M

PDF

TL;DR

This paper introduces SecLens-R, a multi-stakeholder evaluation framework for LLM-based security vulnerability detection, capturing diverse priorities and revealing significant performance variation across stakeholder perspectives.

Contribution

The paper presents a novel multi-dimensional, role-specific evaluation framework for vulnerability detection models, addressing limitations of single-metric benchmarks.

Findings

01

Evaluation reveals substantial variation in model scores across stakeholder profiles.

02

Different models perform variably depending on the stakeholder's priorities.

03

Stakeholder-aware metrics provide deeper insights into model capabilities.

Abstract

Existing benchmarks for LLM-based vulnerability detection compress model performance into a single metric, which fails to reflect the distinct priorities of different stakeholders. For example, a CISO may emphasize high recall of critical vulnerabilities, an engineering leader may prioritize minimizing false positives, and an AI officer may balance capability against cost. To address this limitation, we introduce SecLens-R, a multi-stakeholder evaluation framework structured around 35 shared dimensions grouped into 7 measurement categories. The framework defines five role-specific weighting profiles: CISO, Chief AI Officer, Security Researcher, Head of Engineering, and AI-as-Actor. Each profile selects 12 to 16 dimensions with weights summing to 80, yielding a composite Decision Score between 0 and 100. We apply SecLens-R to evaluate 12 frontier models on a dataset of 406 tasks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.