Evaluating Nova 2.0 Lite model under Amazon's Frontier Model Safety Framework

Satyapriya Krishna; Matteo Memelli; Tong Wang; Abhinav Mohanty; Claire O'Brien Rajkumar; Payal Motwani; Rahul Gupta; and Spyros Matsoukas

arXiv:2601.19134·cs.CR·January 28, 2026

Evaluating Nova 2.0 Lite model under Amazon's Frontier Model Safety Framework

Satyapriya Krishna, Matteo Memelli, Tong Wang, Abhinav Mohanty, Claire O'Brien Rajkumar, Payal Motwani, Rahul Gupta, and Spyros Matsoukas

PDF

Open Access

TL;DR

This paper evaluates Nova 2.0 Lite's safety and capabilities under Amazon’s Frontier Model Safety Framework, using automated benchmarks, expert red-teaming, and uplift studies across high-risk domains.

Contribution

It provides a comprehensive safety evaluation of Nova 2.0 Lite, a highly capable reasoning model processing text, images, and videos with a 1M token context.

Findings

01

Nova 2.0 Lite meets safety thresholds in high-risk domains

02

Automated benchmarks and red-teaming identify potential risks

03

The evaluation methodology will be extended for future models

Abstract

Amazon published its Frontier Model Safety Framework (FMSF) as part of the Paris AI summit, following which we presented a report on Amazon's Premier model. In this report, we present an evaluation of Nova 2.0 Lite. Nova 2.0 Lite was made generally available from amongst the Nova 2.0 series and is one of its most capable reasoning models. The model processes text, images, and video with a context length of up to 1M tokens, enabling analysis of large codebases, documents, and videos in a single prompt. We present a comprehensive evaluation of Nova 2.0 Lite's critical risk profile under the FMSF. Evaluations target three high-risk domains-Chemical, Biological, Radiological and Nuclear (CBRN), Offensive Cyber Operations, and Automated AI R&D-and combine automated benchmarks, expert red-teaming, and uplift studies to determine whether the model exceeds release thresholds. We summarize our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Safety Systems Engineering in Autonomy · Information and Cyber Security