Seclens: Role-specific Evaluation of LLM's for security vulnerablity detection
Subho Halder, Siddharth Saxena, Kashinath Kadaba Shrish, Thiyagarajan M

TL;DR
This paper introduces SecLens-R, a multi-stakeholder evaluation framework for LLM-based security vulnerability detection, capturing diverse priorities and revealing significant performance variation across stakeholder perspectives.
Contribution
The paper presents a novel multi-dimensional, role-specific evaluation framework for vulnerability detection models, addressing limitations of single-metric benchmarks.
Findings
Evaluation reveals substantial variation in model scores across stakeholder profiles.
Different models perform variably depending on the stakeholder's priorities.
Stakeholder-aware metrics provide deeper insights into model capabilities.
Abstract
Existing benchmarks for LLM-based vulnerability detection compress model performance into a single metric, which fails to reflect the distinct priorities of different stakeholders. For example, a CISO may emphasize high recall of critical vulnerabilities, an engineering leader may prioritize minimizing false positives, and an AI officer may balance capability against cost. To address this limitation, we introduce SecLens-R, a multi-stakeholder evaluation framework structured around 35 shared dimensions grouped into 7 measurement categories. The framework defines five role-specific weighting profiles: CISO, Chief AI Officer, Security Researcher, Head of Engineering, and AI-as-Actor. Each profile selects 12 to 16 dimensions with weights summing to 80, yielding a composite Decision Score between 0 and 100. We apply SecLens-R to evaluate 12 frontier models on a dataset of 406 tasks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
