o3-mini vs DeepSeek-R1: Which One is Safer?

Aitor Arrieta; Miriam Ugarte; Pablo Valle; Jos\'e Antonio Parejo,; Sergio Segura

arXiv:2501.18438·cs.SE·February 3, 2025·3 cites

o3-mini vs DeepSeek-R1: Which One is Safer?

Aitor Arrieta, Miriam Ugarte, Pablo Valle, Jos\'e Antonio Parejo,, Sergio Segura

PDF

Open Access 1 Repo

TL;DR

This paper compares the safety of two large language models, DeepSeek-R1 and o3-mini, using an automated testing tool, revealing that DeepSeek-R1 is significantly less safe than o3-mini.

Contribution

It introduces a systematic safety assessment method using the ASTRAL tool to evaluate LLM safety levels.

Findings

01

DeepSeek-R1 has 12% unsafe responses.

02

o3-mini has 1.2% unsafe responses.

03

The assessment demonstrates a safety gap between the models.

Abstract

The irruption of DeepSeek-R1 constitutes a turning point for the AI industry in general and the LLMs in particular. Its capabilities have demonstrated outstanding performance in several tasks, including creative thinking, code generation, maths and automated program repair, at apparently lower execution cost. However, LLMs must adhere to an important qualitative property, i.e., their alignment with safety and human values. A clear competitor of DeepSeek-R1 is its American counterpart, OpenAI's o3-mini model, which is expected to set high standards in terms of performance, safety and cost. In this technical report, we systematically assess the safety level of both DeepSeek-R1 (70b version) and OpenAI's o3-mini (beta version). To this end, we make use of our recently released automated safety testing tool, named ASTRAL. By leveraging this tool, we automatically and systematically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

trust4ai/astral
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRadiomics and Machine Learning in Medical Imaging

Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide) · Sparse Evolutionary Training