Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets

Jialong Zuo; Haoyou Deng; Hanyu Zhou; Jiaxin Zhu; Yicheng Zhang; Yiwei Zhang; Yongxin Yan; Kaixing Huang; Weisen Chen; Yongtai Deng; Rui Jin; Nong Sang; Changxin Gao

arXiv:2512.15110·cs.CV·December 22, 2025

Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets

Jialong Zuo, Haoyou Deng, Hanyu Zhou, Jiaxin Zhu, Yicheng Zhang, Yiwei Zhang, Yongxin Yan, Kaixing Huang, Weisen Chen, Yongtai Deng, Rui Jin, Nong Sang, Changxin Gao

PDF

Open Access 2 Datasets

TL;DR

This paper comprehensively evaluates Nano Banana Pro's performance on 14 low-level vision tasks across 40 datasets, revealing its strengths in subjective quality but limitations in quantitative accuracy compared to specialist models.

Contribution

It provides the first extensive zero-shot benchmark of Nano Banana Pro on diverse low-level vision tasks, highlighting its potential and current limitations.

Findings

01

Nano Banana Pro excels in subjective visual quality.

02

It underperforms in reference-based quantitative metrics.

03

Generative stochasticity affects pixel-level consistency.

Abstract

The rapid evolution of text-to-image generation models has revolutionized visual content creation. While commercial products like Nano Banana Pro have garnered significant attention, their potential as generalist solvers for traditional low-level vision challenges remains largely underexplored. In this study, we investigate the critical question: Is Nano Banana Pro a Low-Level Vision All-Rounder? We conducted a comprehensive zero-shot evaluation across 14 distinct low-level tasks spanning 40 diverse datasets. By utilizing simple textual prompts without fine-tuning, we benchmarked Nano Banana Pro against state-of-the-art specialist models. Our extensive analysis reveals a distinct performance dichotomy: while \textbf{Nano Banana Pro demonstrates superior subjective visual quality}, often hallucinating plausible high-frequency details that surpass specialist models, it lags behind in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications